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PLANT DEVELOPMENTAL GENES 



RELATED APPLICATION INFORMATION 
The present invention claims the benefit from US Provisional Patent Application Serial 

Nos. 60/166,228 filed November 17, 1999 and 60/197,899 filed April 17, 2000 and "Plant Trait 

Modification ni" filed August 22, 2000. 

FIELD OF THE INVENTION 

This invention relates to the field of plant biology. More particularly, the present 

invention pertains to compositions and methods for phenotypically modifying a plant. 

BACKGROUND OF THE INVENTION 

Transcription factors can modulate gene expression, either increasing or 

decreasing (inducing or repressing) the rate of transcription. This modulation results in 
differential levels of gene expression at various developmental stages, in different tissues and cell 
types, and in response to different exogenous (e.g., environmental) and endogenous stimuli 
throughout the life cycle of the organism. 

- Because transcription factors are key controlling elements of biological 
pathways, altering the expression levels of one or more transcription factors can change entire 
biological pathways in an organism. For example, manipulation of the levels of selected 
transcription factors may result in increased expression of economically useful proteins or 
metabolic chemicals in plants or to improve other agriculturally relevant characteristics. 
Conversely, blocked or reduced expression of a transcription factor may reduce biosynthesis of 
unwanted compounds or remove an undesirable trait. Therefore, manipulating transcription 
factor levels in a plant offers tremendous potential in agricultural biotechnology for modifying a 
plant's traits. 

The present invention provides novel transcription factors useful for modifying a 
plant's phenotype in desirable ways, such as modifying a plant's structure or development. 

SUMMARY OF THE INVENTION 

In a first aspect, the invention relates to a recombinant polynucleotide comprising 

a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding a 
polypeptide comprising a sequence selected from SEQ ID Nos. 2N, where N=l-23, or a 
complementary nucleotide sequence thereof; (b) a nucleotide sequence encoding a polypeptide 
comprising a conservatively substituted variant of a polypeptide of (a); (c) a nucleotide sequence 
comprising a sequence selected from those of SEQ ID Nos. 2N-1, where N=l-23, or a 
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complementary nucleotide sequence thereof; (d) a nucleotide sequence comprising silent 
substitutions in a nucleotide sequence of (c); (e) a nucleotide sequence which hybridizes under 
stringent conditions over substantially the entire length of a nucleotide sequence of one or more 
of: (a), (b), (c), or (d); (f) a nucleotide sequence comprising at least 15 consecutive nucleotides of 
5 a sequence of any of (a)-(e); (g) a nucleotide sequence comprising a subsequence or fragment of 
any of (a)-(f), which subsequence or fragment encodes a polypeptide having a biological activity 
that modifies a plant's structure and development characteristics; (h) a nucleotide sequence 
having at least 3 1% sequence identity to a nucleotide sequence of any of (a)-(g); (i) a nucleotide 
sequence having at least 60% identity sequence identity to a nucleotide sequence of any of (a)- 

10 (g)? G) ^ nucleotide sequence which encodes a polypeptide having at least 31% identity sequence 
identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; (k) a nucleotide sequence which 
encodes a polypeptide having at least 60% identity sequence identity to a polypeptide of SEQ ID 
Nos. 2N, where N=l-23; and (1) a nucleotide sequence which encodes a conserved domain of a 
polypeptide having at least 65% sequence identity to a conserved domain of a polypeptide of 

15 SEQ ID Nos. 2N, where N=l-23. The recombinant polynucleotide may further comprise a 

constitutive, inducible, or tissue-active promoter operably linked to the nucleotide sequence. The 
invention also relates to compositions comprising at least two of the above described 
polynucleotides. 

In a second aspect, the invention is an isolated or recombinant polypeptide 
20 comprising a subsequence of at least about 15 contiguous amino acids encoded by the 
recombinant or isolated polynucleotide described above. 

In another aspect, the invention is a transgenic plant comprising one or more of 
the above described recombinant polynucleotides. In yet another aspect, the invention is a plant 
with altered expression levels of a polynucleotide described above or a plant with altered 
25 expression or activity levels of an above described polypeptide. Further, the invention is a plant 
lacking a nucleotide sequence encoding a polypeptide described above. The plant may be a 
soybean, wheat, com, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, 
cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
30 pineapple, spinach, squash, sweet com, tobacco, tomato, watermelon, rosaceous fruits, or 
vegetable brassicas plant. 

In a fiirther aspect, the invention relates to a cloning or expression vector 
comprising the isolated or recombinant polynucleotide described above or cells comprising the 
cloning or expression vector. 
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In yet a further aspect, the invention relates to a composition produced by 
incubating a polynucleotide of the invention with a nuclease, a restriction enzyme, a polymerase; 
a polymerase and a primer; a cloning vector, or with a cell. 

Furthermore, the invention relates to a method for producing a plant having 
5 modified structure and development traits. The method comprises altering the expression of an 
isolated or recombinant polynucleotide of the invention or altering the expression or activity of a 
polypeptide of the invention in a plant to produce a modified plant, and selecting the modified 
plant for modified structure and development traits. 

In another aspect, the invention relates to a method of identifying a factor that is 
10 modulated by or interacts with a polypeptide encoded by a polynucleotide of the invention. The 
method comprises expressing a polypeptide encoded by the polynucleotide in a plant; and 
identifying at least one factor that is modulated by or interacts with the polypeptide. In one 
embodiment the method for identifying modulating or interacting factors is by detecting binding 
by the polypeptide to a promoter sequence, or by detecting interactions between an additional 
15 protein and the polypeptide in a yeast two hybrid system, or by detecting expression of a factor by 
hybridization to a microarray, subtractive hybridization or differential display. 

In yet another aspect, the invention is a method of identifying a molecule that 
modulates activity or expression of a polynucleotide or polypeptide of interest. The method 
comprises placing the molecule in contact with a plant comprising the polynucleotide or 
20 polypeptide encoded by the polynucleotide of the invention and monitoring one or more of the 
expression level of the polynucleotide in the plant, the expression level of the polypeptide in the 
plant, and modulation of an activity of the polypeptide in the plant. 

In yet another aspect, the invention relates to an integrated system, computer or 
computer readable medium comprising one or more character strings corresponding to a 
25 polynucleotide of the invention, or to a polypeptide encoded by the polynucleotide. The 

integrated system, computer or computer readable medium may comprise a link between one or 
more sequence strings to a modified plant structure and development trait. 

In yet another aspect, the invention is a method for identifying a sequence similar 
or homologous to one or more polynucleotides of the invention, or one or more pol3q3eptides 
30 encoded by the polynucleotides. The method comprises providing a sequence database; and, 
querying the sequence database with one or more target sequences corresponding to the one or 
more polynucleotides or to the one or more pol3^eptides to identify one or more sequence 
members of the database that display sequence similarity or homology to one or more of the one 
or more target sequences. 
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The method may further comprise of linking the one or more of the 
polynucleotides of the invention, or encoded polypeptides, to a modified plant structure and 
development characteristics phenotype. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Figure I provides a table of exemplary polynucleotide and polypeptide sequences of the 

invention. The table includes from left to right for each sequence: the SEQ ID No., the internal 
code reference number (GID), whether the sequence is a polynucleotide or polypeptide sequence, 
and identification of any conserved domains for the polj^jeptide sequences. 

Figure 2 provides a table of exemplary sequences that are homologous to other sequences 
10 provided in the Sequence Listing and that are derived from Arabidopsis thaliana. The table 
includes from left to right: the SEQ ID No., the internal code reference nimiber (GID), 
identification of the homologous sequence, whether the sequence is a polynucleotide or 
polypeptide sequence, and identification of any conserved domains for the polypeptide 
sequences. 

15 Figure 3 provides a table of exemplary sequences that are homologous to the sequences 

provided in Figures 1 and 2 and that are derived from plants other than Arabidopsis thaliana. The 
table includes from left to right: the SEQ ID No., the internal code reference number (GID), the 
unique GenBank sequence ID No. (NID), the probability that the comparison was generated by 
chance (P-value), and the species from which the homologous gene was identified. 

20 



DETAILED DESCRIPTION 
The present invention relates to polynucleotides and polypeptides, e.g. for 

modifying phenotypes of plants. 

In particular, the polynucleotides or polypeptides are useful for modifying traits 

25 associated with a plant's structure or development characteristics when the expression levels of 
the pol)niucleotides or expression levels or activity levels of the polypeptides are altered. 
Specifically, the polynucleotides and polypeptides are useful for modifying the structure and size 
of flowers, leaves, roots, the plant as a whole, or the like, apical dominance, branching patterns, 
number of organs, organ identity, whether a plant is sterile or not, the vascularization of a plant, 

30 or the developmental staging of a plant, such as when senescence is triggered. 

The polynucleotides of the invention encode plant transcription factors. The plant 
transcription factors are derived, e.g., from Arabidopsis thaliana and can belong, e.g., to one or 
more of the following transcription factor families: the AP2 (APETALA2) domain transcription 
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factor family (Riechmann and Meyerowitz (1998) J. Biol. Chem. 379:633-646); the MYB 
transcription factor family (Martin and Paz- Ares (1997) Trends Genet. 13:67-73); the MADS 
domain transcription factor family (Riechmann and Meyerowitz (1997) J. Biol. Chem. 378:1079- 
1 101); the WRKY protein family (Ishiguro and Nakamura (1994) Mol. Gen. Genet. 244:563- 
5 571); the ankyrin-repeat protein family (Zhang et al. (1992) Plant Cell 4: 1575-1588); the 

miscellaneous protein (MISC) family (Kim et al. (1997) Plant J. 1 1:1237-1251); the zinc finger 
protein (Z) family (Klug and Schwabe (1995) FASEB J. 9: 597-604); the homeobox (HB) protein 
family (Duboule (1994) Guidebook to the Homeobox Genes. Oxford University Press); the 
CAAT-element binding proteins (Forsburg and Guarente (1989) Genes Dev. 3:1 166-1 178); the 

10 squamosa promoter binding proteins (SPB) (Klein et al. (1996) Mol. Gen. Genet. 1996 250:7-16); 
the NAM protein family; the lAA/AUX proteins (Rouse et al. (1998) Science 279:1371-1373); 
the HLH/MYC protein family (Littlewood et al. (1994) Prot. Profile 1 :639-709); the DNA- 
binding protein (DBP) family (Tucker et al. (1994) EMBO J. 13:2994-3002); the bZIP family of 
transcription factors (Foster et al. (1994) FASEB J. 8: 192-200); the BPF-1 protein (Box P- 

15 binding factor) family (da Costa e Silva et al. (1993) Plant J. 4:125-135); and the golden protein 
(GLD) family (Hall et al. (1998) Plant Cell 10:925-936). 

In addition to methods for modifying a plant phenotype by employing one or 
more polynucleotides and polypeptides of the invention described herein, the polynucleotides 
and polypeptides of the invention have a variety of additional uses. These uses include their use 

20 in the recombinant production (i.e, expression) of proteins; as regulators of plant gene expression, 
as diagnostic probes for the presence of complementary or partially complementary nucleic acids 
(including for detection of natural coding nucleic acids); as substrates for further reactions, e.g., 
mutation reactions, PCR reactions, or the like, of as substrates for cloning e.g., including 
digestion or ligation reactions, and for identifying exogenous or endogenous modulators of the 

25 transcription factors. 

DEFINITIONS 

A "polynucleotide" is a nucleic acid sequence comprising a plurality of 
polymerized nucleotide residues, e.g., at least about 15 consecutive polymerized nucleotide 
residues, optionally at least about 30 consecutive nucleotides, at least about 50 consecutive 
30 nucleotides. In many instances, a polynucleotide comprises a nucleotide sequence encoding a 
polypeptide (or protein) or a domain or fragment thereof. Additionally, the polynucleotide may 
comprise a promoter, an intron, an enhancer region, a polyadenylation site, a translation initiation 
site, 5' or 3' untranslated regions, a reporter gene, a selectable marker, or the like. The 
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polynucleotide can be single stranded or double stranded DNA or RNA. The polynucleotide 
optionally comprises modified bases or a modified backbone. The polynucleotide can be, e.g., 
genomic DNA or RNA, a transcript (such as an mRNA), a cDNA, a PGR product, a cloned DNA, 
a synthetic DNA or RNA, or the like. The polynucleotide can comprise a sequence in either 
5 sense or antisense orientations. 

A "recombinant polynucleotide" is a polynucleotide that is not in its native state, 
e.g., the polynucleotide comprises a nucleotide sequence not found in nature, or the 
polynucleotide is in a context other than that in which it is naturally found, e.g., separated from 
nucleotide sequences with which it typically is in proximity in nature, or adjacent (or contiguous 
10 with) nucleotide sequences with which it typically is not in proximity. For example, the sequence 
at issue can be cloned into a vector, or otherwise recombined with one or more additional nucleic 
acid. 

An "isolated polynucleotide" is a polynucleotide whether naturally occurring or 
recombinant, that is present outside the cell in which it is typically found in nature, whether 

15 purified or not. Optionally, an isolated polynucleotide is subject to one or more enrichment or 
purification procedures, e.g., cell lysis, extraction, centrifugation, precipitation, or the like. 

A "recombinant polypeptide" is a polypeptide produced by translation of a 
recombinant polynucleotide. An "isolated polypeptide," whether a naturally occurring or a 
recombinant polypeptide, is more enriched in (or out of) a cell than the polypeptide in its natural 

20 state in a wild type cell, e.g., more than about 5% enriched, more than about 10% enriched, or 
more than about 20%, or more than about 50%, or more, enriched, i.e., alternatively denoted: 
105%, 110%), 120%>, 150% or more, enriched relative to wild type standardized at 100%>. Such an 
enrichment is not the result of a natural response of a wild type plant. Alternatively, or 
additionally, the isolated polypeptide is separated from other cellular components with which it is 

25 typically associated, e.g., by any of the various protein purification methods herein. 

The term "transgenic plant" refers to a plant that contains genetic material, not 
found in a wild type plant of the same species, variety or cultivar. The genetic material may 
include a transgene, an insertional mutagenesis event (such as by transposon or T-DNA 
insertional mutagenesis), an activation tagging sequence, a mutated sequence, a homologous 

30 recombination event or a sequence modified by chimeraplasty. Typically, the foreign genetic 
material has been introduced into the plant by human manipulation, 

A transgenic plant may contain an expression vector or cassette. The expression 
cassette typically comprises a polypeptide-encoding sequence operably linked (i.e., under 
regulatory control of) to appropriate inducible or constitutive regulatory sequences that allow for 



6 



wo 01/36444 



PCT/USOO/31325 



the expression of polypeptide. The expression cassette can be introduced into a plant by 
transformation or by breeding after transformation of a parent plant. A plant refers to a whole 
plant as well as to a plant part, such as seed, fruit, leaf, or root, plant tissue, plant cells or any 
other plant material, e.g., a plant explant, as well as to progeny thereof, and to in vitro systems 
5 that mimic biochemical or cellular components or processes in a cell. 

The phrase "ectopically expression or altered expression" in reference to a 
polynucleotide indicates that the pattern of expression in, e.g., a transgenic plant or plant tissue, is 
different from the expression pattern in a wild type plant or a reference plant of the same species. 
For example, the polynucleotide or polypeptide is expressed in a cell or tissue type other than a 

10 cell or tissue type in which the sequence is expressed in the wild type plant, or by expression at a 
time other than at the time the sequence is expressed in the wild type plant, or by a response to 
different inducible agents, such as hormones or environmental signals, or at different expression 
levels (either higher or lower) compared with those found in a wild type plant. The term also 
refers to altered expression patterns that are produced by lowering the levels of expression to 

15 below the detection level or completely abolishing expression. The resulting expression pattern 
can be transient or stable, constitutive or inducible, hi reference to a polypeptide, the term 
"ectopic expression or altered expression" further may relate to altered activity levels resulting 
from the interactions of the polypeptides with exogenous or endogenous modulators or from 
interactions with factors or as a result of the chemical modification of the polypeptides. 

20 The term "fragment" or "domain," with respect to a polypeptide, refers to a 

subsequence of the polypeptide. In some cases, the fragment or domain, is a subsequence of the 
polypeptide which performs at least one biological function of the intact polypeptide in 
substantially the same manner, or to a similar extent, as does the intact polypeptide. For example, 
a polypeptide fragment can comprise a recognizable structural motif or functional domain such as 

25 a DNA binding domain that binds to a DNA promoter region, an activation domain or a domain 
for protein-protein interactions. Fragments can vary in size from as few as 6 amino acids to the 
frill length of the intact polypeptide, but are preferably at least about 30 amino acids in length and 
more preferably at least about 60 amino acids in length. In reference to a nucleotide sequence, "a 
fragment" refers to any subsequence of a polynucleotide, typically, of at least consecutive about 

30 15 nucleotides, preferably at least about 30 nucleotides, more preferably at least about 50, of any 
of the sequences provided herein. 

The term "trait" refers to a physiological, morphological, biochemical or physical 
characteristic of a plant or particular plant material or cell. In some instances, this characteristic 
is visible to the human eye, such as seed or plant size, or can be measured by available 
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biochemical techniques, such as the protein, starch or oil content of seed or leaves or by the 
observation of the expression level of genes, e.g., by employing Northern analysis, RT-PCR, 
microarray gene expression assays or reporter gene expression systems, or by agricultural 
observations such as stress tolerance, yield or pathogen tolerance. 
5 "Trait modification" refers to a detectable difference in a characteristic in a plant 

ectopically expressing a polynucleotide or polypeptide of the present invention relative to a plant 
not doing so, such as a wild type plant. In some cases, the trait modification can be evaluated 
quantitatively. For example, the trait modification can entail at least about a 2% increase or 
decrease in an observed trait (difference), at least a 5% difference, at least about a 10% 

10 difference, at least about a 20% difference, at least about a 30%, at least about a 50%, at least 

about a 70%, or at least about a 100%, or an even greater difference. It is known that there can be 
a natural variation in the modified trait. Therefore, the trait modification observed entails a 
change of the normal distribution of the trait in the plants compared with the distribution 
observed in wild type plant. 

15 Trait modifications of particular interest include those to seed ( such as embryo 

or endosperm), finiit, root, flower, leaf, stem, shoot, seedling or the like, including: enhanced 
tolerance to environmental conditions including fi*eezing, chilling, heat, drought, water saturation, 
radiation and ozone; improved tolerance to microbial, fungal or viral diseases; improved 
tolerance to pest infestations, including nematodes, mollicutes, parasitic higher plants or the like; 

20 decreased herbicide sensitivity; improved tolerance of heavy metals or enhanced ability to take up 
heavy metals; improved growth under poor photoconditions (e.g., low light and/or short day 
length), or changes in expression levels of genes of interest. Other phenotype that can be 
modified relate to the production of plant metabolites, such as variations in the production of 
taxol, tocopherol, tocotrienol, sterols, phytosterols, vitamins, wax monomers, anti-oxidants, 

25 amino acids, lignins, cellulose, tannins, prenyllipids (such as chlorophylls and carotenoids), 
glucosinolates, and terpenoids, enhanced or compositionally altered protein or oil production 
(especially in seeds), or modified sugar (insoluble or soluble) and/or starch composition. 
Physical plant characteristics that can be modified include cell development (such as the number 
of trichomes), finit and seed size and number, yields of plant parts such as stems, leaves and 

30 roots, the stability of the seeds during storage, characteristics of the seed pod (e.g., susceptibility 
to shattering), root hair length and quantity, intemode distances, or the quality of seed coat. Plant 
growth characteristics that can be modified include growth rate, germination rate of seeds, vigor 
of plants and seedlings, leaf and flower senescence, male sterility, apomixis, flowering time, 
flower abscission, rate of nitrogen uptake, biomass or transpiration characteristics, as well as 

8 



wo 01/36444 



PCT/USOO/31325 



plant architecture characteristics such as apical dominance, branching patterns, number of organs, 
organ identity, organ shape or size. 

POLYPEPTIDES AND POLYNUCLEOTIDES OF THE INVENTION 

The present invention provides, among other things, transcription factors (TPs), 
5 and transcription factor homologue polypeptides, and isolated or recombinant polynucleotides 
encoding the polypeptides. These polypeptides and polynucleotides may be employed to modify 
a plant's structure and development characteristics. 

Exemplary polynucleotides encoding the polypeptides of the invention were 
identified in the Arabidopsis thaliana GenBank database using publicly available sequence 
10 analysis programs and parameters. Sequences initially identified were then further characterized 
to identify sequences comprising specified sequence strings corresponding to sequence motifs 
present in families of known transcription factors. Polynucleotide sequences meeting such 
criteria were confirmed as transcription factors. 

Additional polynucleotides of the invention were identified by screening 
15 Arabidopsis thaliana and/or other plant cDNA libraries with probes corresponding to known 
transcription factors under low stringency hybridization conditions. Additional sequences, 
including full length coding sequences were subsequently recovered by the rapid amplification of 
cDNA ends (RACE) procedure, using a commercially available kit according to the 
manufacturer's instructions. Where necessary, multiple rounds of RACE are performed to isolate 
20 5' and 3' ends. The fiiU length cDNA was then recovered by a routine end-to-end polymerase 

chain reaction (PCR) using primers specific to the isolated 5' and 3' ends. Exemplary sequences 
are provided in the Sequence Listing. 

The polynucleotides of the invention were ectopically expressed in overexpressor 
or knockout plants and changes in the structure and development characteristics of the plants 
25 were observed. Therefore, the polynucleotides and polypeptides can be employed to improve the 
structure and development characteristics of plants. 

Making polynucleotides 

The polynucleotides of the invention include sequences that encode transcription 
factors and transcription factor homologue polypeptides and sequences complementary thereto, as 
30 well as unique fragments of coding sequence, or sequence complementary thereto. Such 

polynucleotides can be, e.g., DNA or RNA, e.g., mRNA, cRNA, synthetic RNA, genomic DNA, 
cDNA synthetic DNA, oligonucleotides, etc. The polynucleotides are either double-stranded or 
single-stranded, and include either, or both sense (i.e., coding) sequences and antisense (i.e., non- 
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coding, complementary) sequences. The polynucleotides include the coding sequence of a 
transcription factor, or transcription factor homologue polypeptide, in isolation, in combination 
with additional coding sequences (e.g., a purification tag, a localization signal, as a fusion- 
protein, as a pre-protein, or the like), in combination with non-coding sequences (e.g., introns or 
5 inteins, regulatory elements such as promoters, enhancers, terminators, and the like), and/or in a 
vector or host environment in which the polynucleotide encoding a transcription factor or 
transcription factor homologue polypeptide is an endogenous or exogenous gene. 

A variety of methods exist for producing the polynucleotides of the invention. 
Procedures for identifying and isolating DNA clones are well known to those of skill in the art, 
10 and are described in, e.g., Berger and Kimmel, Guide to Molecular Cloning Techniques. Methods 
in Enzymologv volume 152 Academic Press, Inc., San Diego, CA ("Berger"); Sambrook et al.. 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol, 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in Molecular Biology . 
F.M. Ausubel et al., eds.. Current Protocols, a joint venture between Greene Publishing 
15 Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 2000) ("Ausubel"). 

Alternatively, polynucleotides of the invention, can be produced by a variety of 
in vitro amplification methods adapted to the present invention by appropriate selection of 
specific or degenerate primers. Examples of protocols sufficient to direct persons of skill through 
in vitro amplification methods, including the polymerase chain reaction (PCR) the ligase chain 
20 reaction (LCR), Qbeta-replicase amplification and other RNA polymerase mediated techniques 
(e.g., NASBA), e.g., for the production of the homologous nucleic acids of the invention are 
found in Berger, Sambrook, and Ausubel, as well as Mullis et al., (1987) PCR Protocols A Guide 
to Methods and Applications (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) (Innis), 
Improved methods for cloning in vitro amplified nucleic acids are described in Wallace et al., 
25 U.S. Pat. No. 5,426,039. Improved methods for amplifying large nucleic acids by PCR are 

summarized in Cheng et al. (1994) Nature 369: 684-685 and the references cited therein, in which 
PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any 
RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR 
expansion and sequencing using reverse transcriptase and a polymerase. See, e.g., Ausubel, 
30 Sambrook and Berger, all supra. 

Alternatively, polynucleotides and oligonucleotides of the invention can be 
assembled from fragments produced by solid-phase synthesis methods. Typically, fragments of 
up to approximately 100 bases are individually synthesized and then enzymatically or chemically 
ligated to produce a desired sequence, e.g., a polynucletotide encoding all or part of a 
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transcription factor. For example, chemical synthesis using the phosphoramidite method is 
described, e.g., by Beaucage et al. (1981) Tetrahedron Letters 22:1859-69; and Matthes et al. 
(1984) EMBO J. 3:801-5. According to such methods, oligonucleotides are synthesized, purified, 
annealed to their complementary strand, ligated and then optionally cloned into suitable vectors. 
5 And if so desired, the polynucleotides and polypeptides of the invention can be custom ordered 
from any of a number of commercial suppliers. 

HOMOLOGOUS SEQUENCES 

Sequences homologous, i.e., that share significant sequence identity or similarity, 
to those provided in the Sequence Listing, derived from Arabidopsis thaliana or from other plants 

10 of choice are also an aspect of the invention. Homologous sequences can be derived from any 
plant including monocots and dicots and in particular agriculturally important plant species, 
including but not limited to, crops such as soybean, wheat, com, potato, cotton, rice, oilseed rape 
(including canola), sunflower, alfalfa, sugarcane and turf; or fiiiits and vegetables, such as 
banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, 

15 cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
pineapple, spinach, squash, sweet com, tobacco, tomato, watermelon, rosaceous fruits (such as 
apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, 
cauliflower, bmssel sprouts and kohlrabi). Other crops, fruits and vegetables whose phenotype 
can be changed include barley, rye, millet, sorghum, currant, avocado, citms fruits such as 

20 oranges, lemons, grapefruit and tangerines, artichoke, cherries, nuts such as the walnut and 
peanut, endive, leek, roots, such as arrowroot, beet, cassava, tumip, radish, yam, and sweet 
potato, and beans. The homologous sequences may also be derived from woody species, such 
pine, poplar and eucalyptus. 

Transcription factors that are homologous to the listed sequences will typically 

25 share at least about 30% amino acid sequence identity. More closely related transcription factors 
can share at least about 50%, about 60%, about 65%, about 70%, about 75% or about 80% or 
about 90% or about 95% or about 98% or more sequence identity with the listed sequences. 
Factors that are most closely related to the listed sequences share, e.g., at least about 85%, about 
90% or about 95% or more % sequence identity to the listed sequences. At the nucleotide level, 

30 the sequences will typically share at least about 40% nucleotide sequence identity, preferably at 
least about 50%, about 60%, about 70% or about 80% sequence identity, and more preferably 
about 85%, about 90%, about 95% or about 97% or more sequence identity to one or more of the 
listed sequences. The degeneracy of the genetic code enables major variations in the nucleotide 
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sequence of a polynucleotide while maintaining the amino acid sequence of the encoded protein. 
Conserved domains within a transcription factor family may exhibit a higher degree of sequence 
homology, such as at least 65% sequence identity including conservative substitutions, and 
preferably at least 80% sequence identity. 

5 Identifying Nucleic Acids by Hybridization 

Polynucleotides homologous to the sequences illustrated in the Sequence Listing 

can be identified, e.g., by hybridization to each other under stringent or under highly stringent 

conditions. Single stranded polynucleotides hybridize when they associate based on a variety of 

well characterized physico-chemical forces, such as hydrogen bonding, solvent exclusion, base 

10 stacking and the like. The stringency of a hybridization reflects the degree of sequence identity 
of the nucleic acids involved, such that the higher the stringency, the more similar are the two 
polynucleotide strands. Stringency is influenced by a variety of factors, including temperature, 
salt concentration and composition, organic and non-organic additives, solvents, etc. present in 
both the hybridization and wash solutions and incubations (and number), as described in more 

1 5 detail in the references cited above. 

An example of stringent hybridization conditions for hybridization of 
complementary nucleic acids which have more than 100 complementary residues on a filter in a 
Southern or northern blot is about S'^C to 20°C lower than the thermal melting point (Tm) for the 
specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined 

20 ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched 

probe. Nucleic acid molecules that hybridize under stringent conditions will typically hybridize 
to a probe based on either the entire cDNA or selected portions, e.g., to a unique subsequence, of 

the cDNA under wash conditions of 0.2x SSC to 2,0 x SSC, 0.1%) SDS at 50-65^ C, for example 

0.2 X SSC, 0,1% SDS at 65^ C. For identification of less closely related homologues washes can 

25 be performed at a lower temperature, e.g., 50° C. In general, stringency is increased by raising 
the wash temperature and/or decreasing the concentration of SSC. 

As another example, stringent conditions can be selected such that an 
oligonucleotide that is perfectly complementary to the coding oligonucleotide hybridizes to the 
coding oligonucleotide with at least about a 5-lOx higher signal to noise ratio than the ratio for 

30 hybridization of the perfectly complementary oligonucleotide to a nucleic acid encoding a 

transcription factor known as of the filing date of the application. Conditions can be selected 
such that a higher signal to noise ratio is observed in the particular assay which is used, e.g., 
about 15x, 25x, 35x, 50x or more. Accordingly, the subject nucleic acid hybridizes to the unique 
coding oligonucleotide with at least a 2x higher signal to noise ratio as compared to hybridization 
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of the coding oligonucleotide to a nucleic acid encoding known polypeptide. Again, higher 
signal to noise ratios can be selected, e.g., about 5x, lOx, 25x, 35x, 50x or more. The particular 
signal will depend on the label used in the relevant assay, e.g., a fluorescent label, a colorimetric 
label, a radio active label, or the like. 
5 Alternatively, transcription factor homologue polypeptides can be obtained by 

screening an expression library using antibodies specific for one or more transcription factors. 
With the provision herein of the disclosed transcription factor, and transcription factor homologue 
nucleic acid sequences, the encoded polypeptide(s) can be expressed and purified in a 
heterologous expression system (e.g., E. coll) and used to raise antibodies (monoclonal or 

10 polyclonal) specific for the polypeptide(s) in question. Antibodies can also be raised against 
synthetic peptides derived from transcription factor, or transcription factor homologue, amino 
acid sequences. Methods of raising antibodies are well known in the art and are described in 
Harlow and Lane (1988) Antibodies: A Laboratory Manual , Cold Spring Harbor Laboratory, New 
York. Such antibodies can then be used to screen an expression library produced from the plant 

15 from which it is desired to clone additional transcription factor homologues, using the methods 
described above. The selected cDNAs can be confirmed by sequencing and enzymatic activity. 

SEQUENCE VARL\TIONS 

It will readily be appreciated by those of skill in the art, that any of a variety of 

polynucleotide sequences are capable of encoding the transcription factors and transcription 
20 factor homologue polypeptides of the invention. Due to the degeneracy of the genetic code, 

many different polynucleotides can encode identical and/or substantially similar polypeptides in 

addition to those sequences illustrated in the Sequence Listing. 

For example. Table 1 illustrates, e.g., that the codons AGC, AGT, TCA, TCC, 

TCG, and TCT all encode the same amino acid: serine. Accordingly, at each position in the 
25 sequence where there is a codon encoding serine, any of the above trinucleotide sequences can be 

used without altering the encoded polypeptide. 
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Table 1 



Amino acids 


Codon 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


TGC 


TGT 










Aspartic acid 


Asp 


D 


GAG 


GAT 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


TTC 


TTT 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGT 






Histidine 


His 


H 


CAC 


CAT 










Isoleucine 


ne 


I 


ATA 


ATC 


ATT 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


TTA 


TTG 


CTA 


CTC 


CTCj 


CTT 


Methionine 


Met 


M 


ATG 












Asparagine 


Asn 


N 


AAC 


AAT 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


CCT 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGT 


Serine 


Ser 


S 


AGC 


AGT 


TCA 


TCC 


TCG 


TCT 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACT 






Valine 


Val 


V 


GTA 


GTC 


GTG 


GTT 






Tryptophan 


Trp 


W 


TGG 












Tyrosine 


Tyr 


Y 


TAC 


TAT 











Sequence alterations that do not change the amino acid sequence encoded by the 
5 polynucleotide are termed "silent" variations. With the exception of the codons ATG and TGG, 
encoding methionine and tryptophan, respectively, any of the possible codons for the same amino 
acid can be substituted by a variety of techniques, e.g., site-directed mutagenesis, available in the 
art. Accordingly, any and all such variations of a sequence selected from the above table are a 
feature of the invention. 

10 hi addition to silent variations, other conservative variations that alter one, or a 

few amino acids in the encoded polypeptide, can be made without altering the function of the 
polypeptide, these conservative variants are, likewise, a feature of the invention. 

For example, substitutions, deletions and insertions introduced into the sequences 
provided in the Sequence Listing are also envisioned by the invention. Such sequence 

1 5 modifications can be engineered into a sequence by site -directed mutagenesis (Wu (ed.) Meth. 
Enzvmol . (1993) vol, 217, Academic Press) or the other methods noted below. Amino acid 
substitutions are typically of single residues; insertions usually will be on the order of about from 
1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues. In preferred 
embodiments, deletions or insertions are made in adjacent pairs, e.g., a deletion of two residues or 

20 insertion of two residues. Substitutions, deletions, insertions or any combination thereof can be 
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combined to arrive at a sequence. The mutations that are made in the polynucleotide encoding the 
transcription factor should not place the sequence out of reading frame and should not create 
complementary regions that could produce secondary mRNA structure. Preferably, the 
polypeptide encoded by the DNA performs the desired function. 

Conservative substitutions are those in which at least one residue in the amino 
acid sequence has been removed and a different residue inserted in its place. Such substitutions 
generally are made in accordance with the Table 2 when it is desired to maintain the activity of 
the protein. Table 2 shows amino acids which can be substituted for an amino acid in a protein 
and which are typically regarded as conservative substitutions. 

Table 2 



Residue 


Conservative Substitutions 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Gin 


Asn 


Cys 


Ser 


Glu 


Asp 


Gly 


Pro 


His 


Asn; Gin 


He 


Leu, Val 


Leu 


He; Val 


Lys 


Arg; Gin 


Met 


Leu; He 


Phe 


Met; Leu; Tyr 


Ser 


Thr; Gly 


Thr 


Ser;Val 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Substitutions that are less conservative than those in Table 2 can be selected by 
picking residues that differ more significantly in their effect on maintaining (a) the structure of 
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the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of 
the side chain. The substitutions which in general are expected to produce the greatest changes in 
protein properties will be those in which (a) a hydrophilic residue, e.g., seryl or threonyl, is 
5 substituted for (or by) a hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl or alanyl; 
(b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 

10 FURTHER MODIFYING SEQUENCES OF THE INVENTION— MUTATION/ FORCED 
EVOLUTION 

In addition to generating silent or conservative substitutions as noted, above, the 
present invention optionally includes methods of modifying the sequences of the Sequence 
Listing. In the methods, nucleic acid or protein modification methods are used to alter the given 

1 5 sequences to produce new sequences and/or to chemically or enzymatically modify given 
sequences to change the properties of the nucleic acids or proteins. 

Thus, in one embodiment, given nucleic acid sequences are modified, e.g., 
according to standard mutagenesis or artificial evolution methods to produce modified sequences. 
For example, Ausubel, supra, provides additional details on mutagenesis methods. Artificial 

20 forced evolution methods are described, e.g., by Stemmer (1994) Nature 370:389-391, and 
Stemmer ( 1 994) Proc. Natl. Acad. Sci. USA 91:1 0747- 10751. Many other mutation and 
evolution methods are also available and expected to be within the skill of the practitioner. 

Similarly, chemical or enzymatic alteration of expressed nucleic acids and 
polypeptides can be performed by standard methods. For example, sequence can be modified by 

25 addition of lipids, sugars, peptides, organic or inorganic compounds, by the inclusion of modified 
nucleotides or amino acids, or the like. For example, protein modification techniques are 
illustrated in Ausubel, supra. Further details on chemical and enzymatic modifications can be 
found herein. These modification methods can be used to modify any given sequence, or to 
modify any sequence produced by the various mutation and artificial evolution modification 

30 methods noted herein. 

Accordingly, the invention provides for modification of any given nucleic acid 
by mutation, evolution, chemical or enzymatic modification, or other available methods, as well 
as for the products produced by practicing such methods, e.g., using the sequences herein as a 
starting substrate for the various modification approaches. 
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For example, optimized coding sequence containing codons preferred by a 
particular prokaryotic or eukaryotic host can be used e.g., to increase the rate of translation or to 
produce recombinant RNA transcripts having desirable properties, such as a longer half-life, as 
compared with transcripts produced using a non-optimized sequence. Translation stop codons 
5 can also be modified to reflect host preference. For example, preferred stop codons for S. 
cerevisiae and. mammals are TAA and TGA, respectively. The preferred stop codon for 
monocotyledonous plants is TGA, whereas insects and E. coli prefer to use TAA as the stop 
codon. 

The polynucleotide sequences of the present invention can also be engineered in 
10 order to alter a coding sequence for a variety of reasons, including but not limited to, alterations 
which modify the sequence to facilitate cloning, processing and/or expression of the gene 
product. For example, alterations are optionally introduced using techniques which are well 
known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, to alter 
glycosylation pattems, to change codon preference, to introduce splice sites, etc. 
15 Furthermore, a fragment or domain derived from any of the polypeptides of the 

invention can be combined with domains derived from other transcription factors or synthetic 
domains to modify the biological activity of a transcription factor. For instance, a DNA binding 
domain derived from a transcription factor of the invention can be combined with the activation 
domain of another transcription factor or with a synthetic activation domain. A transcription 
20 activation domain assists in initiating transcription from a DNA binding site. Examples include 
the transcription activation region of VP 16 or GAL4 (Moore et al. (1998) Proc. Natl. Acad. Sci. 
USA 95: 376-381; and Aoyama et al, (1995) Plant Cell 7:1773-1785), peptides derived from 
bacterial sequences (Ma and Ptashne (1987) Cell 51; 1 13-119) and synthetic peptides (Giniger 
and Ptashne, (1987) Nature 330:670-672). 

25 EXPRESSION AND MODIFICATION OF POLYPEPTIDES 

Typically, polynucleotide sequences of the invention are incorporated into 
recombinant DNA (or RNA) molecules that direct expression of polypeptides of the invention in 
appropriate host cells, transgenic plants, in vitro translation systems, or the like. Due to the 
inherent degeneracy of the genetic code, nucleic acid sequences which encode substantially the 

30 same or a functionally equivalent amino acid sequence can be substituted for any listed sequence 
to provide for cloning and expressing the relevant homologue. 
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Vectors, Promoters and Expression Systems 

The present invention includes recombinant constructs comprising one or more 
of the nucleic acid sequences herein. The constructs typically comprise a vector, such as a 
plasmid, a cosmid, a phage, a virus (e.g., a plant virus), a bacterial artificial chromosome (BAG), 
5 a yeast artificial chromosome (YAC), or the like, into which a nucleic acid sequence of the 
invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are 
known to those of skill in the art, and are commercially available. 

10 General texts which describe molecular biological techniques useful herein, 

including the use and production of vectors, promoters and many other relevant topics, include 
Berger, Sambrook and Ausubel, supra. Any of the identified sequences can be incorporated into a 
cassette or vector, e.g., for expression in plants. A number of expression vectors suitable for stable 
transformation of plant cells or for the establishment of transgenic plants have been described 

15 including those described in Weissbach and Weissbach, (1989^ Methods for Plant Molecular 
Biology, Academic Press, and Gelvin et al., (1990) Plant Molecular Biologv Manual , Kluwer 
Academic Publishers. Specific examples include those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed by Herrera-Estrella et al. (1983) Nature 
303: 209, Bevan (1984) Nucl Acid Res. 12: 8711-8721, Klee (1985) Bio/Technologv 3: 637-642, 

20 for dicotyledonous plants. 

Alternatively, non-Ti vectors can be used to transfer the DNA into 
monocotyledonous plants and cells by using free DNA delivery techniques. Such methods can 
involve, for example, the use of liposomes, electroporation, microprojectile bombardment, silicon 
carbide whiskers, and viruses. By using these methods transgenic plants such as wheat, rice 

25 (Ghristou (1991) Bio/Technoloev 9: 957-962) and com (Gordon-Kamm (1990) Plant Cell 2: 603- 
618) can be produced. An immature embryo can also be a good target tissue for monocots for 
direct DNA delivery techniques by using the particle gun (Weeks et al. (1993) Plant Phvsiol 102: 
1077-1084; Vasil (1993) Bio/Technoloev 10: 667-674; Wan and Lemeaux (1994) Plant Phvsiol 
104: 37-48, and for Agrobacterium-mediated DNA transfer (Ishida et al. (1996) Nature Biotech 

30 14: 745-750). 

Typically, plant transformation vectors include one or more cloned plant coding 
sequence (genomic or cDNA) under the transcriptional control of 5* and 3* regulatory sequences 
and a dominant selectable marker. Such plant transformation vectors typically also contain a 
promoter (e.g., a regulatory region controlling inducible or constitutive, environmentally-or 
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developmentally-regulated, or cell- or tissue-specific expression), a transcription initiation start 
site, an RNA processing signal (such as intron splice sites), a transcription termination site, and/or 
a polyadenylation signal. 

Examples of constitutive plant promoters which can be useful for expressing the 
5 TF sequence include: the cauliflower mosaic virus (CaMV) 35 S promoter, which confers 
constitutive, high-level expression in most plant tissues {see, e.g., Odel et al. (1985) Nature 
313:810); the nopaline synthase promoter (An et al. (1988) Plant Phvsiol 88:547); and the 
octopine synthase promoter (Fromm et al. (1989) Plant Cell 1: 977). 

A variety of plant gene promoters that regulate gene expression in response to 

10 environmental, hormonal, chemical, developmental signals, and in a tissue-active manner can be 
used for expression of a TF sequence in plants. Choice of a promoter is based largely on the 
phenotype of interest and is determined by such factors as tissue (e.g., seed, fruit, root, pollen, 
vascular tissue, flower, carpel, etc.), inducibility (e.g., in response to wounding, heat, cold, 
drought, light, pathogens, etc.), timing, developmental stage, and the like. Numerous known 

1 5 promoters have been characterized and can favorable be employed to promote expression of a 
polynucleotide of the invention in a transgenic plant or cell of interest. For example, tissue 
specific promoters include: seed-specific promoters (such as the napin, phaseolin or DC3 
promoter described in US Pat. No. 5,773,697), fruit-specific promoters that are active during fi*uit 
ripening (such as the dm 1 promoter (US Pat. No. 5,783,393), or the 2A1 1 promoter (US Pat, No. 

20 4,943,674) and the tomato polygalacturonase promoter (Bird et al. (1988) Plant Mol Biol 1 1 :65 1), 
root-specific promoters, such as those disclosed in US Patent Nos. 5,618,988, 5,837,848 and 
5,905,186, pollen-active promoters such as PTA29, PTA26 and PTA13 (US Pat. No. 5,792,929), 
promoters active in vascular tissue (Ringli and Keller (1998) Plant Mol Biol 37:977-988), flower- 
specific (Kaiser et al, (1995) Plant Mol Biol 28:231-243), pollen (Baerson et al. (1994) Plant Mol 

25 Biol 26:1947-1959), carpels (Ohl et al. (1990) Plant Cell 2:837-848), pollen and ovules (Baerson 
et al. (1993) Plant Mol Biol 22:255-267), auxin-inducible promoters (such as that described in 
van der Kop et al. (1999) Plant Mol Biol 39:979-990 or Baumann et al. (1999) Plant Cell 1 1:323- 
334), cytokinin-inducible promoter (Guevara-Garcia (1998) Plant Mol Biol 38:743-753), 
promoters responsive to gibberellin (Shi et al. (1998) Plant Mol Biol 38:1053-1060, Willmott et 

30 al. (1998) 38:817-825) and the like. Additional promoters are those that elicit expression in 
response to heat (Ainley et al. (1993) Plant Mol Biol 22: 13-23), light (e.g., the pea rbcS-3A 
promoter, Kuhlemeier et al. (1989) Plant Cell 1:471, and the maize rbcS promoter, Schaffher and 
Sheen (1991) Plant Cell 3: 997); wounding (e.g., wuni, Siebertz et al. (1989) Plant Cell 1: 961); 
pathogens (such as the PR-1 promoter described in Buchel et al. (1999) Plant Mol. Biol. 40:387- 
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396, and the PDF1.2 promoter described in Manners et al. (1998) Plant Mol. BioL 38:1071-80), 
and chemicals such as methyl jasmonate or salicylic acid (Gatz et al. (1997) Plant Mol Biol 48: 89- 
108). In addition, the timing of the expression can be controlled by using promoters such as those 
acting at senescence (An and Amazon (1995) Science 270: 1986-1988); or late seed development 
5 (Odell et al. (1994) Plant Physiol 106:447-458). 

Plant expression vectors can also include RNA processing signals that can be 
positioned within, upstream or downstream of the coding sequence. In addition, the expression 
vectors can include additional regulatory sequences from the 3 '-untranslated region of plant 
genes, e.g., a 3' terminator region to increase mRNA stability of the mRNA, such as the PI-II 
10 terminator region of potato or the octopine or nopaline synthase 3' terminator regions. 

Additional Expression Elements 

Specific initiation signals can aid in efficient translation of coding sequences. 
These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where 
a coding sequence, its initiation codon and upstream sequences are inserted into the appropriate 

15 expression vector, no additional translational control signals may be needed. However, in cases 
where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is 
inserted, exogenous transcriptional control signals including the ATG initiation codon can be 
separately provided. The initiation codon is provided in the correct reading frame to facilitate 
transcription. Exogenous transcriptional elements and initiation codons can be of various origins, 

20 both natural and synthetic. The efficiency of expression can be enhanced by the inclusion of 
enhancers appropriate to the cell system in use. 

Expression Hosts 

The present invention also relates to host cells which are transduced with vectors 
of the invention, and the production of polypeptides of the invention (including fragments 

25 thereof) by recombinant techniques. Host cells are genetically engineered (i.e, nucleic acids are 
introduced, e.g., transduced, transformed or transfected) with the vectors of this invention, which 
may be, for example, a cloning vector or an expression vector comprising the relevant nucleic 
acids herein. The vector is optionally a plasmid, a viral particle, a phage, a naked nucleic acids, 
etc. The engineered host cells can be cultured in conventional nutrient media modified as 

30 appropriate for activating promoters, selecting transformants, or amplifying the relevant gene. 
The culture conditions, such as temperature, pH and the like, are those previously used with the 
host cell selected for expression, and will be apparent to those skilled in the art and in the 
references cited herein, including, Sambrook and Ausubel. 
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The host cell can be a eukaryotic cell, such as a yeast cell, or a plant cell, or the 
host cell can be a prokaryotic cell, such as a bacterial cell. Plant protoplasts are also suitable for 
some applications. For example, the DNA fragments are introduced into plant tissues, cultured 
plant cells or plant protoplasts by standard methods including electroporation (Fromm et al., 
5 (1985) Proc, Natl. Acad. Sci. USA 82, 5824, infection by viral vectors such as cauliflower mosaic 
virus (CaMV) (Hohn et al., (1982) Molecular Biology of Plant Tumors , (Academic Press, New 
York) pp. 549-560; US 4,407,956), high velocity ballistic penetration by small particles with the 
nucleic acid either within the matrix of small beads or particles, or on the surface (Klein et al., 
(1987) Nature 327, 70-73), use of pollen as vector (WO 85/01856), or use of Agrobacterium 

10 tumefaciens or A. rhizogenes carrying a T-DNA plasmid in which DNA fragments are cloned. 
The T-DNA plasmid is transmitted to plant cells upon infection by Agrobacterium tumefaciens, 
and a portion is stably integrated into the plant genome (Horsch et al. (1984) Science 233:496- 
498; Fraley et al. (1983) Proc. Natl. Acad. Sci. USA 80, 4803). 

The cell can include a nucleic acid of the invention which encodes a polypeptide, 

15 wherein the cells expresses a polypeptide of the invention. The cell can also include vector 

sequences, or the like. Furthermore, cells and transgenic plants which include any polypeptide or 
nucleic acid above or throughout this specification, e.g., produced by transduction of a vector of 
the invention, are an additional feature of the invention. 

For long-term, high-yield production of recombinant proteins, stable expression 

20 can be used. Host cells transformed with a nucleotide sequence encoding a polypeptide of the 

invention are optionally cultured under conditions suitable for the expression and recovery of the 
encoded protein from cell culture. The protein or fragment thereof produced by a recombinant 
cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 

25 containing polynucleotides encoding mature proteins of the invention can be designed with signal 
sequences which direct secretion of the mature polypeptides through a prokaryotic or eukaryotic 
cell membrane. 

Modified Amino Acids 

Polypeptides of the invention may contain one or more modified amino acids. 
30 The presence of modified amino acids may be advantageous in, for example, increasing 

polypeptide half-life, reducing polypeptide antigenicity or toxicity, increasing polypeptide storage 
stability, or the like. Amino acid(s) are modified, for example, co-translationally or post- 
translationally during recombinant production or modified by synthetic or chemical means. 
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Non-limiting examples of a modified amino acid include incorporation or other 
use of acetylated amino acids, glycosylated amino acids, sulfated amino acids, prenylated (e.g., 
famesylated, geranylgeranylated) amino acids, PEG modified (e.g., "PEGylated") amino acids, 
biotinylated amino acids, carboxylated amino acids, phosphorylated amino acids, etc. References 
5 adequate to guide one of skill in the modification of amino acids are replete throughout the 
literature. 

IDENTIFICATION OF ADDITIONAL FACTORS 

A transcription factor provided by the present invention can also be used to 
identify additional endogenous or exogenous molecules that can affect a phentoype or trait of 

10 interest. On the one hand, such molecules include organic (small or large molecules) and/or 
inorganic compounds that affect expression of (i.e., regulate) a particular transcription factor. 
Alternatively, such molecules include endogenous molecules that are acted upon either at a 
transcriptional level by a transcription factor of the invention to modify a phenotype as desired. 
For example, the transcription factors can be employed to identify one or more downstream gene 

1 5 with which is subject to a regulatory effect of the transcription factor. In one approach, a 

transcription factor or transcription factor homologue of the invention is expressed in a host cell, 
e.g, a transgenic plant cell, tissue or explant, and expression products, either RNA or protein, of 
likely or random targets are monitored, e.g., by hybridization to a microarray of nucleic acid 
probes corresponding to genes expressed in a tissue or cell type of interest, by two-dimensional 

20 gel electrophoresis of protein products, or by any other method known in the art for assessing 
expression of gene products at the level of RNA or protein. Alternatively, a transcription factor 
of the invention can be used to identify promoter sequences (i.e., binding sites) involved in the 
regulation of a downstream target. After identifying a promoter sequence, interactions between 
the transcription factor and the promoter sequence can be modified by changing specific 

25 nucleotides in the promoter sequence or specific amino acids in the transcription factor that 
interact with the promoter sequence to alter a plant trait. Typically, transcription factor DNA 
binding sites are identified by gel shift assays. After identifying the promoter regions, the 
promoter region sequences can be employed in double-stranded DNA arrays to identify 
molecules that affect the interactions of the transcription factors with their promoters (Bulyk et al. 

30 (1999) Nature Biotechnoloev 17:573-577). 

The identified transcription factors are also useful to identify proteins that modify 
the activity of the transcription factor. Such modification can occur by covalent modification, 
such as by phosphorylation, or by protein-protein (homo or-heteropolymer) interactions. Any 
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method suitable for detecting protein-protein interactions can be employed. Among the methods 
that can be employed are co-immunoprecipitation, cross-linking and co-purification through 
gradients or chromatographic columns, and the two-hybrid yeast system. 

The two-hybrid system detects protein interactions in vivo and is described in 
5 Chien, et al., (1991), Proc. Natl. Acad. Sci. USA 88, 9578-9582 and is commercially available 
from Clontech (Palo Alto, Calif.), hi such a system, plasmids are constructed that encode two 
hybrid proteins: one consists of the DNA-binding domain of a transcription activator protein 
fused to the TF polypeptide and the other consists of the transcription activator protein's 
activation domain fiised to an unknown protein that is encoded by a cDNA that has been 

10 recombined into the plasmid as part of a cDNA library. The DNA-binding domain fusion plasmid 
and the cDNA library are transformed into a strain of the yeast Saccharomyces cerevisiae that 
contains a reporter gene (e.g., lacZ) whose regulatory region contains the transcription activator's 
binding site. Either hybrid protein alone cannot activate transcription of the reporter gene, 
hiteraction of the two hybrid proteins reconstitutes the functional activator protein and results in 

1 5 expression of the reporter gene, which is detected by an assay for the reporter gene product. Then, 
the library plasmids responsible for reporter gene expression are isolated and sequenced to 
identify the proteins encoded by the library plasmids. After identifying proteins that interact with 
the transcription factors, assays for compounds that interfere with the TF protein-protein 
interactions can be preformed. 

20 IDENTIFICATION OF MODULATORS 

Li addition to the intracellular molecules described above, extracellular 
molecules that alter activity or expression of a transcription factor, either directly or indirectly, 
can be identified. For example, the methods can entail first placing a candidate molecule in 
contact with a plant or plant cell. The molecule can be introduced by topical administration, such 

25 as spraying or soaking of a plant, and then the molecule's effect on the expression or activity of 
the TF polypeptide or the expression of the polynucleotide monitored. Changes in the expression 
of the TF polypeptide can be monitored by use of polyclonal or monoclonal antibodies, gel 
electrophoresis or the like. Changes in the expression of the corresponding polynucleotide 
sequence can be detected by use of microarrays. Northerns, quantitative PCR, or any other 

30 technique for monitoring changes in mRNA expression. These techniques are exemplified in 
Ausubel et al. (eds) Current Protocols in Molecular Biologv , John Wiley & Sons (1998). Such 
changes in the expression levels can be correlated with modified plant traits and thus identified 
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molecules can be useful for soaking or spraying on fruit, vegetable and grain crops to modify 
traits in plants. 

Essentially any available composition can be tested for modulatory activity of 
expression or activity of any nucleic acid or polypeptide herein. Thus, available libraries of 
5 compounds such as chemicals, polypeptides, nucleic acids and the like can be tested for 

modulatory activity. Often, potential modulator compounds can be dissolved in aqueous or 
organic (e.g., DMSO-based) solutions for easy delivery to the cell or plant of interest in which the 
activity of the modulator is to be tested. Optionally, the assays are designed to screen large 
modulator composition libraries by automating the assay steps and providing compounds from 

10 any convenient source to assays, which are typically run in parallel (e.g., in microtiter formats on 
microtiter plates in robotic assays). 

In one embodiment, high throughput screening methods involve providing a 
combinatorial library containing a large number of potential compounds (potential modulator 
compounds). Such "combinatorial chemical libraries" are then screened in one or more assays, as 

15 described herein, to identify those library members (particular chemical species or subclasses) 
that display a desired characteristic activity. The compounds thus identified can serve as target 
compounds. 

A combinatorial chemical library can be, e.g., a collection of diverse chemical 
compounds generated by chemical synthesis or biological synthesis. For example, a 

20 combinatorial chemical library such as a polypeptide library is formed by combining a set of 
chemical building blocks (e.g., in one example,'amino acids) in every possible way for a given 
compound length (i.e., the number of amino acids in a polypeptide compound of a set length). 
Exemplary libraries include peptide libraries, nucleic acid libraries, antibody libraries (see, e.g., 
Vaughn et al. (1996) Nature Biotechnologv , 14(3):309-314 and PCT/US96/ 10287), carbohydrate 

25 libraries (see, e.g., Liang et al. Science (1996) 274: 1520-1522 and U.S. Patent 5,593,853), 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), and sniall organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&EN Jan 18, page 33 (1993); isoprenoids, U.S. 
Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 5,549,974; pyrrolidines, U.S. 
Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent 5,506,337) and the like. 

30 Preparation and screening of combinatorial or other libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. Res. 37:487-493 
(1991) and Houghton et al. Nature 354:84-88 (1991)). Other chemistries for generating chemical 
diversity libraries can also be used. 
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In addition, as noted, compound screening equipment for high-throughput 
screening is generally available, e.g., using any of a number of well known robotic systems that 
have also been developed for solution phase chemistries useful in assay systems. These systems 
include automated workstations including an automated synthesis apparatus and robotic systems 
5 utilizing robotic arms. Any of the above devices are suitable for use with the present invention, 
e.g., for high-throughput screening of potential modulators. The nature and implementation of 
modifications to these devices (if any) so that they can operate as discussed herein will be 
apparent to persons skilled in the relevant art. 

hideed, entire high throughput screening systems are commercially available. 

10 These systems typically automate entire procedures including all sample and reagent pipetting, 
liquid dispensing, timed incubations, and final readings of the microplate in detector(s) 
appropriate for the assay. These configurable systems provide high throughput and rapid start up 
as well as a high degree of flexibility and customization. Similarly, microfluidic implementations 
of screening are also commercially available. 

15 The manufacturers of such systems provide detailed protocols the various high 

throughput. Thus, for example, Zymark Corp. provides technical bulletins describing screening 
systems for detecting the modulation of gene transcription, ligand binding, and the like. The 
integrated systems herein, in addition to providing for sequence alignment and, optionally, 
synthesis of relevant nucleic acids, can include such screening apparatus to identify modulators 

20 that have an effect on one or more polynucleotides or polypeptides according to the present 
invention. 

In some assays it is desirable to have positive controls to ensure that the 
components of the assays are working properly. At least two types of positive controls are 
appropriate. That is, known transcriptional activators or inhibitors can be incubated with 

25 cells/plants/ etc, in one sample of the assay, and the resulting increase/decrease in transcription 

can be detected by measuring the resulting increase in RNA/ protein expression, etc., according to 
the methods herein. It will be appreciated that modulators can also be combined with 
transcriptional activators or inhibitors to find modulators which inhibit transcriptional activation 
or transcriptional repression. Either expression of the nucleic acids and proteins herein or any 

30 additional nucleic acids or proteins activated by the nucleic acids or proteins herein, or both, can 
be monitored. 

In an embodiment, the invention provides a method for identifying compositions 
that modulate the activity or expression of a polynucleotide or polypeptide of the invention. For 
example, a test compound, whether a small or large molecule, is placed in contact with a cell, 
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plant (or plant tissue or explant), or composition comprising the polynucleotide or polypeptide of 
interest and a resulting effect on the cell, plant, (or tissue or explant) or composition is evaluated 
by monitoring, either directly or indirectly, one or more of: expression level of the polynucleotide 
or polypeptide, activity (or modulation of the activity) of the polynucleotide or polypeptide. In 
some cases, an alteration in a plant phenotype can be detected following contact of a plant (or 
plant cell, or tissue or explant) with the putative modulator, e.g., by modulation of expression or 
activity of a polynucleotide or polypeptide of the invention. 

SUBSEQUENCES 

Also contemplated are uses of polynucleotides, also referred to herein as 

oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably at least 

20, 30, or 50 bases, which hybridize under at least highly stringent (or ultra-high stringent or 

ultra-ultra- high stringent conditions) conditions to a polynucleotide sequence described above. 

The polynucleotides may be used as probes, primers, sense and antisense agents, and the like, 

according to methods as noted supra. 

Subsequences of the polynucleotides of the invention, including polynucleotide 
fragments and oligonucleotides are useful as nucleic acid probes and primers. An oligonucleotide 
suitable for use as a probe or primer is at least about 15 nucleotides in length, more often at least 
about 18 nucleotides, often at least about 21 nucleotides, frequently at least about 30 nucleotides, 
or about 40 nucleotides, or more in length. A nucleic acid probe is useful in hybridization 
protocols, e.g., to identify additional polypeptide homologues of the invention, including 
protocols for microarray experiments. Primers can be annealed to a complementary target DNA 
strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA 
strand, and then extended along the target DNA strand by a DNA polymerase enzyme. Primer 
pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain 
reaction (PCR) or other nucleic-acid amplification methods. See Sambrook and Ausubel, supra. 

In addition, the invention includes an isolated or recombinant polypeptide 
including a subsequence of at least about 15 contiguous amino acids encoded by the recombinant 
or isolated polynucleotides of the invention. For example, such polypeptides, or domains or 
fragments thereof, can be used as immunogens, e.g., to produce antibodies specific for the 
polypeptide sequence, or as probes for detecting a sequence of interest. A subsequence can range 
in size from about 15 amino acids in length up to and including the fiill length of the polypeptide. 
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PRODUCTION OF TRANSGENIC PLANTS 
Modification of Traits 

The polynucleotides of the invention are favorably employed to produce 
transgenic plants with various traits, or characteristics, that have been modified in a desirable 
5 manner, e.g., to improve the seed characteristics of a plant. For example, alteration of expression 
levels or patterns (e.g., spatial or temporal expression patterns) of one or more of the transcription 
factors (or transcription factor homologues) of the invention, as compared with the levels of the 
same protein found in a wild type plant, can be used to modify a plant's traits. An illustrative 
example of trait modification, modified structure and development characteristics, by altering 
10 expression levels of a particular transcription factor is described further in the Examples and the 
Sequence Listing. 

Antisense and Cosuppression Approaches 

In addition to expression of the nucleic acids of the invention as gene 
replacement or plant phenotype modification nucleic acids, the nucleic acids are also useful for 

15 sense and anti-sense suppression of expression, e.g., to down-regulate expression of a nucleic 
acid of the invention, e.g., as a further mechanism for modulating plant phenotype. That is, the 
nucleic acids of the invention, or subsequences or anti-sense sequences thereof, can be used to 
block expression of naturally occurring homologous nucleic acids. A variety of sense and anti- 
sense technologies are known in the art, e.g., as set forth in Lichtenstein and Nellen (1997) 

20 Antisense Technologv: A Practical Approach IRL Press at Oxford University, Oxford, England. 
In general, sense or anti-sense sequences are introduced into a cell, where they are optionally 
amplified, e.g., by transcription. Such sequences include both simple oligonucleotide sequences 
and catalytic sequences such as ribozymes. 

For example, a reduction or elimination of expression (i.e., a "knock-out") of a 

25 transcription factor or transcription factor homologue polypeptide in a transgenic plant, e.g., to 
modify a plant trait, can be obtained by introducing an antisense construct corresponding to the 
polypeptide of interest as a cDNA. For antisense suppression, the transcription factor or homologue 
cDNA is arranged in reverse orientation (with respect to the coding sequence) relative to the 
promoter sequence in the expression vector. The introduced sequence need not be the full length 

30 cDNA or gene, and need not be identical to the cDNA or gene found in the plant type to be 

transformed. Typically, the antisense sequence need only be capable of hybridizing to the target 
gene or RNA of interest. Thus, where the introduced sequence is of shorter length, a higher 
degree of homology to the endogenous transcription factor sequence will be needed for effective 
antisense suppression. While antisense sequences of various lengths can be utilized, preferably. 
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the introduced antisense sequence in the vector will be at least 30 nucleotides in length, and 
improved antisense suppression will typically be observed as the length of the antisense sequence 
increases. Preferably, the length of the antisense sequence in the vector will be greater than 100 
nucleotides. Transcription of an antisense construct as described results in the production of 
5 RNA molecules that are the reverse complement of mRNA molecules transcribed from the 
endogenous transcription factor gene in the plant cell. 

Suppression of endogenous transcription factor gene expression can also be 
achieved using a ribozyme. Ribozymes are RNA molecules that possess highly specific 
endoribonuc lease activity. The production and use of ribozymes are disclosed in U.S. Patent No. 

10 4,987,071 and U.S. Patent No. 5,543,508. Synthetic ribozyme sequences including antisense 

RNAs can be used to confer RNA cleaving activity on the antisense RNA, such that endogenous 
mRNA molecules that hybridize to the antisense RNA are cleaved, which in turn leads to an 
enhanced antisense inhibition of endogenous gene expression. 

Vectors in which RNA encoded by a transcription factor or transcription factor 

15 homologue cDNA is over-expressed can also be used to obtain co-suppression of a corresponding 
endogenous gene, e.g., in the manner described in U.S. Patent No, 5,23 1,020 to Jorgensen. Such 
co-suppression (also termed sense suppression) does not require that the entire transcription factor 
cDNA be introduced into the plant cells, nor does it require that the introduced sequence be 
exactly identical to the endogenous transcription factor gene of interest. However, as with 

20 antisense suppression, the suppressive efficiency will be enhanced as specificity of hybridization 
is increased, e.g., as the introduced sequence is lengthened, and/or as the sequence similarity 
between the introduced sequence and the endogenous transcription factor gene is increased. 

Vectors expressing an untranslatable form of the transcription factor mRNA, e.g., 
sequences comprising one or more stop codon, or nonsense mutation) can also be used to 

25 suppress expression of an endogenous transcription factor, thereby reducing or eliminating it's 
activity and modifying one or more traits. Methods for producing such constructs are described 
in U.S. Patent No. 5,583,021. Preferably, such constructs are made by introducing a premature 
stop codon into the transcription factor gene. Alternatively, a plant trait can be modified by gene 
silencing using double-strand RNA (Sharp (1999) Genes and Development 13: 139-141). 

30 Another method for abolishing the expression of a gene is by insertion 

mutagenesis using the T-DNA of Agrobacterium tumefaciens. After generating the insertion 
mutants, the mutants can be screened to identify those containing the insertion in a transcription 
factor or transcription factor homologue gene. Plants containing a single transgene insertion 
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event at the desired gene can be crossed to generate homozygous plants for the mutation (Koncz 
et al. (1992) Methods in Arabidopsis Research, World Scientific). 

Alternatively, a plant phenotype can be altered by eliminating an endogenous 
gene, such as a transcription factor or transcription factor homologue, e.g., by homologous 
5 recombination (Kempin et al. (1997) Nature 389:802). 

A plant trait can also be modified by using the cre-lox system (for example, as 
described in US Pat. No. 5,658,772). A plant genome can be modified to include first and 
second lox sites that are then contacted with a Cre recombinase. If the lox sites are in the same 
orientation, the intervening DNA sequence between the two sites is excised. If the lox sites are in 

10 the opposite orientation, the intervening sequence is inverted. 

The pol3aiucleotides and polypeptides of this invention can also be expressed in a 
plant in the absence of an expression cassette by manipulating the activity or expression level of 
the endogenous gene by other means. For example, by ectopically expressing a gene by T-DNA 
activation tagging (Ichikawa et al. (1997) Nature 390 698-701; Kakimoto et al. (1996) Science 

15 274: 982-985). This method entails transforming a plant with a gene tag containing multiple 

transcriptional enhancers and once the tag has inserted into the genome, expression of a flanking 
gene coding sequence becomes deregulated. In another example, the transcriptional machinery in 
a plant can be modified so as to increase transcription levels of a polynucleotide of the invention 
{See, e.g., PCT Publications WO 96/06166 and WO 98/53057 which describe the modification of 

20 the DNA binding specificity of zinc finger proteins by changing particular amino acids in the 
DNA binding motif). 

The transgenic plant can also include the machinery necessary for expressing or 
altering the activity of a polypeptide encoded by an endogenous gene, for example by altering the 
phosphorylation state of the polypeptide to maintain it in an activated state. 

25 Transgenic plants (or plant cells, or plant explants, or plant tissues) incorporating 

the polynucleotides of the invention and/or expressing the polypeptides of the invention can be 
produced by a variety of well established techniques as described above. Following construction 
of a vector, most typically an expression cassette, including a polynucleotide, e.g., encoding a 
transcription factor or transcription factor homologue, of the invention, standard techniques can 

30 be used to introduce the polynucleotide into a plant, a plant cell, a plant explant or a plant tissue 
of interest. Optionally, the plant cell, explant or tissue can be regenerated to produce a transgenic 
plant. 

The plant can be any higher plant, including gymnosperms, monocotyledonous 
and dicotyledenous plants. Suitable protocols are available for Leguminosae (alfalfa, soybean, 
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clover, etc.), Umbelliferae (carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, 
broccoli, etc.), Curcurbitaceae (melons and cucumber), Gramineae (wheat, com, rice, barley, 
millet, etc.), Solanaceae (potato, tomato, tobacco, peppers, etc.), and various other crops. See 
protocols described in Ammirato et al. (1984) Handbook of Plant Cell Culture -Crop Species . 
5 Macmillan Publ. Co. Shimamoto et al, (1989) Nature 338:274-276: Fromm et al. (1990) 
Bio/Technology 8:833-839; and Vasil et al. (1990) Bio/Technology 8:429-434. 

Transformation and regeneration of both monocotyledonous and dicotyledonous 
plant cells is now routine, and the selection of the most appropriate transformation technique will 
be determined by the practitioner. The choice of method will vary with the type of plant to be 

10 transformed; those skilled in the art will recognize the suitability of particular methods for given 
plant types. Suitable methods can include, but are not limited to: electroporation of plant 
protoplasts; liposome-mediated transformation; polyethylene glycol (PEG) mediated 
transformation; transformation using viruses; micro-injection of plant cells; micro-projectile 
bombardment of plant cells; vacuum infiltration; and Agrobacterium tumeficiens mediated 

15 transformation. Transformation means introducing a nucleotide sequence in a plant in a manner to 
cause stable or transient expression of the sequence. 

Successful examples of the modification of plant characteristics by 
transformation with cloned sequences which serve to illustrate the current knowledge in this field 
of technology, and which are herein incorporated by reference, include: U.S. Patent Nos. 

20 5,571,706; 5,677,175; 5,510,471; 5,750,386; 5,597,945; 5,589,615; 5,750,871; 5,268,526; 
5,780,708; 5,538,880; 5,773,269; 5,736,369 and 5,610,042. 

Following transformation, plants are preferably selected using a dominant 
selectable marker incorporated into the transformation vector. Typically, such a marker will 
confer antibiotic or herbicide resistance on the transformed plants, and selection of transformants 

25 can be accomplished by exposing the plants to appropriate concentrations of the antibiotic or 
herbicide. 

After transformed plants are selected and grown to maturity, those plants 
showing a modified trait are identified. The modified trait can be any of those traits described 
above. Additionally, to confirm that the modified trait is due to changes in expression levels or 
30 activity of the polypeptide or polynucleotide of the invention can be determined by analyzing 
mRNA expression using Northern blots, RT-PCR or microarrays, or protein expression using 
immunoblots or Western blots or gel shift assays. 
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INTEGRATED SYSTEMS— SEQUENCE IDENTITY 

Additionally, the present invention may be an integrated system, computer or 
computer readable medium that comprises an instruction set for determining the identity of one or 
more sequences in a database. In addition, the instruction set can be used to generate or identify 
5 sequences that meet any specified criteria. Furthermore, the instruction set may be used to 
associate or link certain functional benefits, such modified structure and development 
characteristics, with one or more identified sequence. 

For example, the instruction set can include, e.g., a sequence comparison or other 
alignment program, e.g., an available program such as, for example, the Wisconsin Package 

10 Version 10.0, such as BLAST, FASTA, PILEUP, FINDPATTERNS or the like (GCG, Madision, 
WI). Public sequence databases such as GenBank, EMBL, Swiss-Prot and PIR or private 
sequence databases such as PhytoSeq (Incyte Pharmaceuticals, Palo Alto, CA) can be searched. 

Alignment of sequences for comparison can be conducted by the local homology 
algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2:482, by the homology alignment 

15 algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A . 85: 2444, by computerized 
implementations of these algorithms. After alignment, sequence comparisons between two (or 
more) polynucleotides or polypeptides are typically performed by comparing sequences of the 
two sequences over a comparison window to identify and compare local regions of sequence 

20 similarity. The comparison window can be a segment of at least about 20 contiguous positions, 
usually about 50 to about 200, more usually about 100 to about 150 contiguous positions. A 
description of the method is provided in Ausubel et al., supra. 

A variety of methods of determining sequence relationships can be used, 
including manual alignment and computer assisted sequence alignment and analysis. This later 

25 approach is a preferred approach in the present invention, due to the increased throughput 

afforded by computer assisted methods. As noted above, a variety of computer programs for 
performing sequence alignment are available, or can be produced by one of skill. 

One example algorithm that is suitable for determining percent sequence identity 
and sequence similarity is the BLAST algorithm, which is described in Altschul et al. J. Mol. Biol 

30 215:403-410 (1990). Software for performing BLAST analyses is publicly available, e.g., 

through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
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referred to as the neighborhood word score threshold (Altschul et al, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. 
The word hits are then extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, for 
5 nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 
0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a 
scoring matrix is used to calculate the cumulative score. Extension of the word hits in each 
direction are halted when: the cumulative alignment score falls off by the quantity X from its 
maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of 

10 one or more negative-scoring residue alignments; or the end of either sequence is reached. The 
BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. 
The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an 
expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino 
acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) 

15 of 10, and the BLOSUM62 scoring matrix {see Henikoff & Henikoff (1 989 ) Proc. Natl. Acad. 
Sci. USA 89:10915). 

In addition to calculating percent sequence identity, the BLAST algorithm also 
performs a statistical analysis of the similarity between two sequences {see, e.g., Karl in & 
Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided 

20 by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of 
the probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. For example, a nucleic acid is considered similar to a reference sequence (and, 
therefore, in this context, homologous) if the smallest sum probability in a comparison of the test 
nucleic acid to the reference nucleic acid is less than about 0.1, or less than about 0.01, and or 

25 even less than about 0.001. An additional example of a useful sequence alignment algorithm is 

PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using 
progressive, pairwise alignments. The program can align, e.g., up to 300 sequences of a 
maximum length of 5,000 letters. 

The integrated system, or computer typically includes a user input interface 

30 allowing a user to selectively view one or more sequence records corresponding to the one or 

more character strings, as well as an instruction set which aligns the one or more character strings 
with each other or with an additional character string to identify one or more region of sequence 
similarity. The system may include a link of one or more character strings with a particular 
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phenotype or gene fixnction. Typically, the system includes a user readable output element which 
displays an alignment produced by the alignment instruction set. 

The methods of this invention can be implemented in a localized or distributed 
computing environment. In a distributed environment, the methods may implemented on a single 
5 computer comprising multiple processors or on a multiplicity of computers. The computers can 
be linked, e.g. through a common bus, but more preferably the computer(s) are nodes on a 
network. The network can be a generalized or a dedicated local or wide-area network and, in 
certain preferred embodiments, the computers may be components of an intra-net or an internet. 

Thus, the invention provides methods for identifying a sequence similar or 

10 homologous to one or more polynucleotides as noted herein, or one or more target polypeptides 
encoded by the polynucleotides, or otherwise noted herein and may include linking or associating 
a given plant phenotype or gene function with a sequence. Li the methods, a sequence database is 
provided (locally or across an inter or intra net) and a query is made against the sequence 
database using the relevant sequences herein and associated plant phenotypes or gene functions. 

15 Any sequence herein can be entered into the database, before or after querying 

the database. This provides for both expansion of the database and, if done before the querying 
step, for insertion of control sequences into the database. The control sequences can be detected 
by the query to ensure the general integrity of both the database and the query. As noted, the 
query can be performed using a web browser based interface. For example, the database can be a 

20 centralized public database such as those noted herein, and the querying can be done from a 
remote terminal or computer across an internet or intranet. 

EXAMPLES 

The following examples are intended to illustrate but not limit the present 

invention. 

25 EXAMPLE I. FULL LENGTH GENE IDENTIFICATION AND CLONING 

Putative transcription factor sequences (genomic or ESTs) related to known 
transcription factors were identified in the Arabidopsis thaliana GenBank database using the 
tblastn sequence analysis program using default parameters and a P-value cutoff threshold of -4 
or —5 or lower, depending on the length of the query sequence. Putative transcription factor 

30 sequence hits were then screened to identify those containing particular sequence strings. If the 
sequence hits contained such sequence strings, the sequences were confirmed as transcription 
factors. 
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Alternatively, Arabidopsis thaliana cDNA libraries derived from different tissues 
or treatments, or genomic libraries were screened to identify novel members of a transcription 
family using a low stringency hybridization approach. Probes were synthesized using gene 
specific primers in a standard PCR reaction (annealing temperature 60° C) and labeled with ^^P 
5 dCTP using the High Prime DNA Labeling Kit (Boehringer Mannheim). Purified radiolabeled 
probes were added to filters immersed in Church hybridization medium (0.5 M NaP04 pH 7.0, 
7% SDS, 1 % w/v bovine serum albumin) and hybridized overnight at 60 °C with shaking. Filters 
were washed two times for 45 to 60 minutes with IxSCC, 1% SDS at 60"" C. 

To identify additional sequence 5' or 3' of a partial cDNA sequence in a cDNA 

10 library, 5' and 3' rapid amplification of cDNA ends (RACE) was performed using the Marathon™ 
cDNA amplification kit (Clontech, Palo Alto, CA). Generally, the method entailed first isolating 
poly(A) mRNA, performing first and second strand cDNA synthesis to generate double stranded 
cDNA, blunting cDNA ends, followed by ligation of the Marathon™ Adaptor to the cDNA to 
form a library of adaptor-ligated ds cDNA. 

15 Gene-specific primers were designed to be used along with adaptor specific 

primers for both 5' and 3' RACE reactions. Nested primers, rather than single primers, were used 
to increase PCR specificity. Using 5' and 3' RACE reactions, 5' and 3' RACE fragments were 
obtained, sequenced and cloned. The process can be repeated until 5' and 3' ends of the full- 
length gene were identified. Then the full-length cDNA was generated by PCR using primers 

20 specific to 5 ' and 3 ' ends of the gene by end-to-end PCR. 

EXAMPLE II. CONSTRUCTION OF EXPRESSION VECTORS 

The sequence was amplified from a genomic or cDNA library using primers 
specific to sequences upstream and downstream of the coding region. The expression vector was 
pMEN20 or pMEN65, which are both derived from pMON3 16 (Sanders et al, (1987 ) Nucleic 

25 Acids Research 15:1543-58) and contain the CaMV 35S promoter to express transgenes. To 

clone the sequence into the vector, both pMEN20 and the amplified DNA fragment were digested 
separately with Sail and NotI restriction enzymes at 37° C for 2 hours. The digestion products 
were subject to electrophoresis in a 0.8% agarose gel and visualized by ethidium bromide 
staining. The DNA fragments containing the sequence and the linearized plasmid were excised 

30 and purified by using a Qiaquick gel extraction kit (Qiagen, CA). The fragments of interest were 
ligated at a ratio of 3 : 1 (vector to insert). Ligation reactions using T4 DNA ligase (New England 
Biolabs, MA) were carried out at 16° C for 16 hours. The ligated DNAs were transformed into 
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competent cells of the coli strain DHSalpha by using the heat shock method. The 
transformations were plated on LB plates containing 50 mg/I kanamycin (Sigma). 

Individual colonies were grown overnight in five milliliters of LB broth 
containing 50 mg/1 kanamycin at 37** C. Plasmid DNA was purified by using Qiaquick Mini 
5 Prep kits (Qiagen, CA). 

EXAMPLE III. TRANSFORMATION OF AGROBACTERIUM WITH THE EXPRESSION 
VECTOR 

After the plasmid vector containing the gene was constructed, the vector was 
used to transform Agrobacterium tumefaciens cells expressing the gene products. The stock of 

10 Agrobacterium tumefaciens cells for transformation were made as described by Nagel et al. 

(1990) FEMS Microbiol Letts . 67: 325-328. Agrobacterium strain ABI was grown in 250 ml LB 
medium (Sigma) overnight at 28**C with shaking until an absorbance (Aeoo) of 0.5 - 1.0 was 
reached. Cells were harvested by centrifugation at 4,000 x g for 15 min at 4° C. Cells were then 
resuspended in 250 jal chilled buffer (1 mM HEPES, pH adjusted to 7.0 with KOH). Cells were 

15 centrifuged again as described above and resuspended in 125 \i\ chilled buffer. Cells were then 
centrifuged and resuspended two more times in the same HEPES buffer as described above at a 
volume of 100 ^1 and 750 \x\, respectively. Resuspended cells were then distributed into 40 \x\ 
aliquots, quickly frozen in liquid nitrogen, and stored at -80° C. 

Agrobacterium cells were transformed with plasmids prepared as described 

20 above following the protocol described by Nagel et al. For each DNA construct to be 

transformed, 50 - 100 ng DNA (generally resuspended in 10 mM Tris-HCl, 1 mM EDTA, pH 
8.0) was mixed with 40 fxl of Agrobacterium cells. The DNA/cell mixture was then transferred to 
a chilled cuvette with a 2mm electrode gap and subject to a 2.5 kV charge dissipated at 25 |iF and 
200 i^F using a Gene Pulser 11 apparatus (Bio-Rad). After electroporation, cells were 

25 immediately resuspended in 1 .0 ml LB and allowed to recover without antibiotic selection for 2 - 
4 hours at 28° C in a shaking incubator. After recovery, cells were plated onto selective medium 
of LB broth containing 100 |ig/ml spectinomycin (Sigma) and incubated for 24-48 hours at 28° C. 
Single colonies were then picked and inoculated in fresh medium. The presence of the plasmid 
construct was verified by PCR amplification and sequence analysis. 

30 EXAMPLE IV. TRANSFORMATION OF ARABIDOPSIS PLANTS WITH AGROBACTERIUM 
TUMEFACIENS WITH EXPRESSION VECTOR 

After transformation of Agrobacterium tumefaciens with plasmid vectors 

containing the gene, single Agrobacterium colonies were identified, propagated, and used to 
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tronsform Arabidopsis plants. Briefly, 500 ml cultures of LB medium containing 50 mg/1 
kanamycin were inoculated with the colonies and grown at 28° C with shaking for 2 days until an 
absorbance (Aeoo) of > 2.0 is reached. Cells were then harvested by centrifugation at 4,000 x g 
for 10 min, and resuspended in infiltration medium (1/2 X Murashige and Skoog salts (Sigma), 1 
5 X Gamborg's B-5 vitamins (Sigma), 5.0% (w/v) sucrose (Sigma), 0.044 ^iM benzylamino purine 
(Sigma), 200 |il/L Silwet L-77 (Lehle Seeds) until an absorbance (Aeoo) of 0.8 was reached. 

Prior to transformation, Arabidopsis thaliana seeds (ecotype Columbia) were 
sown at a density of ~10 plants per 4" pot onto Pro-Mix BX potting medium (Hummert 
International) covered with fiberglass mesh (18 mm X 16 mm). Plants were grown under 

10 continuous illumination (50-75 ^E/mVsec) at 22-23° C with 65-70% relative humidity. After 
about 4 weeks, primary inflorescence stems (bolts) are cut off to encourage growth of multiple 
secondary bolts. After flowering of the mature secondary bolts, plants were prepared for 
transformation by removal of all siliques and opened flowers. 

The pots were then immersed upside down in the mixture of Agrobacterium 

15 infiltration medium as described above for 30 sec, and placed on their sides to allow draining into 
a r x 2' flat surface covered with plastic wrap. After 24 h, the plastic wrap was removed and 
pots are turned upright. The immersion procedure was repeated one week later, for a total of two 
immersions per pot. Seeds were then collected from each transformation pot and analyzed 
following the protocol described below. 

20 EXAMPLE V. IDENTIFICATION OF ARABIDOPSIS PRIMARY TRANSFORMANTS 
Seeds collected from the transformation pots were sterilized essentially as 
follows. Seeds were dispersed into in a solution containing 0. 1% (v/v) Triton X-100 (Sigma) and 
sterile H2O and washed by shaking the suspension for 20 min. The wash solution was then 
drained and replaced with fresh wash solution to wash the seeds for 20 min with shaking. After 

25 removal of the second wash solution, a solution containing 0. 1% (v/v) Triton X-100 and 70% 
ethanol (Equistar) was added to the seeds and the suspension was shaken for 5 min. After 
removal of the ethanol/detergent solution, a solution containing 0.1% (v/v) Triton X-100 and 30% 
(v/v) bleach (Clorox) was added to the seeds, and the suspension was shaken for 10 min. After 
removal of the bleach/detergent solution, seeds were then washed five times in sterile distilled 

30 H2O. The seeds were stored in the last wash water at 4° C for 2 days in the dark before being 

plated onto antibiotic selection mediimi (IX Murashige and Skoog salts (pH adjusted to 5.7 with 
IM KOH), 1 X Gamborg's B-5 vitamins, 0.9% phytagar (Life Technologies), and 50 mg/1 
kanamycin). Seeds were germinated under continuous illumination (50-75 jixE/m^/sec) at 22-23° 
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C. After 7-10 days of growth under these conditions, kanamycin resistant primary transformants 
(Ti generation) were visible and obtained. These seedlings were transferred first to fresh 
selection plates where the seedlings continued to grow for 3-5 more days, and then to soil (Pro- 
Mix BX potting medium). 
5 Primary transformants were crossed and progeny seeds (T2) collected; kanamycin 

resistant seedlings were selected and analyzed. The expression levels of the recombinant 
polynucleotides in the transformants varies from about a 5% expression level increase to a least a 
100% expression level increase. Similar observations are made with respect to polypeptide level 
expression. 

10 

EXAMPLE VI. IDENTIFICATION OF ARABIDOPSIS PLANTS WITH TRANSCRIPTION 
FACTOR GENE KNOCKOUTS 

The screening of insertion mutSLgenized Arabidopsis collections for null mutants 

in a known target gene was essentially as described in Krysan et al (1999) Plant Cell 1 1 :2283- 

1 5 2290. Briefly, gene-specific primers, nested by 5-250 pb to each others, were designed from the 
5' and 3' regions of a known target gene. Similarly, nested sets of primers were also created 
specific to each of the T-DNA or transposon ends (the "right" and "left" borders). All possible 
combinations of gene specific and T-DNA/transposon primers were used to detect by PCR an 
insertion event within or close to the target gene. The amplified DNA fragments were then 

20 sequenced which allows the precise determination of the T-DNA/transposon insertion point 
relative to the target gene. Insertion events within the coding or intervening sequence of the 
genes were deconvoluted from a pool comprising a plurality of insertion events to a single unique 
mutant plant for functional characterization. The method is described in more detail in Yu and 
Adam, US Application Serial No. 09/177,733 filed October 23, 1998. 

25 EXAMPLE VII. IDENTIFICATION OF STRUCTURE AND DEVELOPMENT 

CHARACTERISTICS PHENOTYPE IN OVEREXPRESSOR OR GENE KNOCKOUT 
PLANTS 

Experiments were performed to identify those transformants or knockouts that 
exhibited a modified structure and development characteristics. For such studies, the 
30 transformants were observed by eye to identify novel structural or developmental characteristics 
associated with the ectopic expression of the polynucleotides or polypeptides of the invention. 

Table 3 shows the phenotypes observed for particular overexpressor or knockout 
plants and provides the SEQ ED No., the internal reference code (GID), whether a knockout or 
overexpressor plant was analyzed and the observed phenotype. 
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Table 3 



SEQ ID No. 


GID 


Knockout (KO) or 
overexpressor (KO) 


Phenotype observed 


1 
1 




OE 


Plants were small and more dark creen in 

color, late flowering and poorly fertile. 


3 


G732 


OE 


Plants were small and inflorescence was 
unelongated. Flowers parts appeared to be un- 
elongated and the plants were semi-sterile. 


5 


G9 


OE 


Increased root mass 


7 


G428 


OE 


Lobed and highly serrated leaves and 
abnormal first and second whorl floral organs 


9 


G869 


OE 


Undeveloped or small anthers 


11 


G1269 


OE 


Extended petioles and leaves pointed upwards 


13 


G1038 


OE 


Altered leaf shape 


15 


G438 


KO 


Reduced lignin in stem 


1 n 
1 / 


LjD / 1 


jSXJ 


i-ieiayeo senescence ai ine enu oi me piani 
lifecycle 


19 


G748 


OE 


More vascular bundles in stem 


21 


G431 


OE 


Severe developmental abnormalities such as 
altered branching, twisted rosette leaves, 
flowers with missing pistils, fused stamens and 
atypical numbers of petals and stamens, 
reoucea scconciary doiis, ano. lacK, ui vauiine 
leaves. 


23 


G187 


OE 


Plants had long, thin cotyledons and reduced 
apical dominance. Several flower 
aonormauLies, mciuaing unucrucvciupcu, 
seoaloid netals and underdeveloned anthers 
were also observed. 


25 


G470 


OE 


Plants were sterile due to failure of anthers to 
elongate 


27 


G615 


OE 


Plants were sterile due to failure of anthers to 
develop and failure of stamens to elongate. 
Fused cotyledons and absence of a shoot apical 
meristem and true leaves was also observed. 


29 


G1073 


OE 


Increased plant size and serrated leaves 



For a particular overexpressor that shows a less beneficial structure and development 
5 characteristic, it may be more useful to select a plant with a decreased expression of the particular 
transcription factor. For a particular knockout that shows a less beneficial structure and 
development characteristic, it may be more useful to select a plant with an increased expression 
of the particular transcription factor. 
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EXAMPLE VIII. IDENTIFICATION OF HOMOLOGOUS SEQUENCES 

Homologous sequences from Arabidopsis and plant species other than Arabidopsis were 
identified using database sequence search tools, such as the Basic Local Alignment Search Tool 
(BLAST) (Altschul et al. (1990) J. Mol. Biol. 215:403-410; and Altschul et al. (1997) Nucl. Acid 
5 Res. 25: 3389-3402). The tblastx sequence analysis programs were employed using the 

BLOSUM-62 scoring matrix (Henikoff, S. and Henikoff, J. G. (1992) Proc. Natl. Acad. Sci. USA 
89: 10915-10919). 

Identified Arabidopsis homologous sequences are provided in Figure 2 and included in 
the Sequence Listing. The percent sequence identity among these sequences is as low as 47% 

10 sequence identity. Additionally, the entire NCBI GenBank database was filtered for sequences 
from all plants except Arabidopsis thaliana by selecting all entries in the NCBI GenBank 
database associated with NCBI taxonomic ED 33090 (Viridiplantae; all plants) and excluding 
entries associated with taxonomic ID 3701 {Arabidopsis thaliana). These sequences were 
compared to sequences representing genes of SEQ IDs Nos. 1-54 on 9/26/2000 using the 

15 Washington University TBLASTX algorithm (version 2.0al9MP). For each gene of SEQ IDs 

Nos. 1-54, individual comparisons were ordered by probability score (P-value), where the score 
reflects the probability that a particular alignment occurred by chance. For example, a score of 
3.6e-40 is 3.6 X 10"*^. For up to ten species, the gene with the lowest P-value (and therefore the 
most likely homolog) is listed in Figure 3. 

20 In addition to P-values, comparisons were also scored by percentage identity. Percentage 

identity reflects the degree to which two segments of DNA or protein are identical over a 
particular length. The ranges of percent identity between the non- Arabidopsis genes shown in 
Figure 3 and the Arabidopsis genes in the sequence listing are: SEQ ID No. 1: 36%-69%; SEQ ID 
No, 3: 46%-54%; SEQ ID No. 5: 57%-72%; SEQ ID No. 7: 54%-69%; SEQ ID No. 9: 31%-68%; 

25 SEQ ID No. 1 1: 47%-90%; SEQ ID No. 13: 34%-82%; SEQ ID No. 15: 49%-88%; SEQ ID No. 
17: 56%-67%; SEQ ID No. 19: 39%-61%; SEQ ID No. 21: 61%-87%; SEQ ID No. 23: 38%- 
85%; SEQ ID No. 25: 44%-94%; SEQ ID No. 27: 35%-44%; SEQ ID No. 29: 37%-71%; SEQ ID 
No. 31: 38%-77%; SEQ ID No. 33: 57%-69%; SEQ ID No. 35: 54%-69%; SEQ ID No. 37: 60%- 
75%; SEQ ID No. 39: 47%-65%; SEQ ID No. 41 : 60%-88%; SEQ ID No. 43: 43%-87%; and 

30 SEQ ID No. 45: 53%-97%, 

The polynucleotides and polypeptides in the Sequence Listing and the identified 
homologous sequences may be stored in a computer system and have associated or linked with 
the sequences a function, such as that the polynucleotides and polypeptides are useful for 
modifying the structure and development characteristics of a plant. 
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All references, publications, patents and other documents herein are incorporated by 
reference in their entirety for all purposes. Although the invention has been described with 
reference to the embodiments and examples above, it should be understood that various 
5 modifications can be made without departing from the spirit of the invention. 
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What is claimed is: 

1 . A transgenic plant with modified structure and development characteristics, which plant 
comprises a recombinant polynucleotide comprising a nucleotide sequence selected from the 
group consisting of: 

5 (a) a nucleotide sequence encoding a polypeptide comprising a sequence selected from 

SEQ ID Nos. 2N, where N=l-23, or a complementary nucleotide sequence thereof; 

(b) a nucleotide sequence encoding a polypeptide comprising a conservatively substituted 
variant of a polypeptide of (a); 

(c) a nucleotide sequence comprising a sequence selected from those of SEQ ID Nos. 2N- 
10 1, where N=l-23, or a complementary nucleotide sequence thereof; 

(d) a nucleotide sequence comprising silent substitutions in a nucleotide sequence of (c); 

(e) a nucleotide sequence which hybridizes under stringent conditions to a nucleotide 
sequence of one or more of: (a), (b), (c), or (d); 

(f) a nucleotide sequence comprising at least 15 consecutive nucleotides of a sequence of 
15 any of (a)-(e); 

(g) a nucleotide sequence comprising a subsequence or fragment of any of (a)-(f), which 
subsequence or fragment encodes a polypeptide that modifies a plant's structure and 
development characteristics; 

(h) a nucleotide sequence having at least 3 1% sequence identity to a nucleotide sequence 
20 of any of (a)-(g); 

(i) a nucleotide sequence having at least 60% identity sequence identity to a nucleotide 
sequence of any of (a)-(g); 

(j) a nucleotide sequence which encodes a polypeptide having at least 31% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; 
25 (k) a nucleotide sequence which encodes a polypeptide having at least 60% identity 

sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; and 
(1) a nucleotide sequence which encodes a polypeptide having at least 65% sequence 
identity to a conserved domain of a polypeptide of SEQ ID Nos. 2N, where N=l-23. 

30 2. The transgenic plant of claim 1, further comprising a constitutive, inducible, or tissue- 
active promoter operably linked to said nucleotide sequence, 

3. The transgenic plant of claim 1, wherein the plant is selected from the group consisting 
of: soybean, wheat, com, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
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banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, 
cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
pineapple, spinach, squash, sweet com, tobacco, tomato, watermelon, rosaceous fruits, and 
vegetable brassicas. 

5 

4. An isolated or recombinant polynucleotide comprising a nucleotide sequence selected 

from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising a sequence selected from 
SEQ ID Nos, 2N, where N=l-23, or a complementary nucleotide sequence thereof; 
10 (b) a nucleotide sequence encoding a polypeptide comprising a conservatively substituted 

variant of a polypeptide of (a); 

(c) a nucleotide sequence comprising a sequence selected from those of SEQ ID Nos. 2N- 
1, where N=l-23, or a complementary nucleotide sequence thereof; 

(d) a nucleotide sequence comprising silent substitutions in a nucleotide sequence of (c); 
15 (e) a nucleotide sequence which hybridizes under stringent conditions to a nucleotide 

sequence of one or more of: (a), (b), (c), or (d); 

(f) a nucleotide sequence comprising at least 15 consecutive nucleotides of a sequence of 
any of (a)-(e); 

(g) a nucleotide sequence comprising a subsequence or fragment of any of (a)-(f), which 
20 subsequence or fragment encodes a polypeptide that modifies a plant's structure and 

development characteristics; 

(h) a nucleotide sequence having at least 31% sequence identity to a nucleotide sequence 
of any of (a)-(g); 

(i) a nucleotide sequence having at least 60% identity sequence identity to a nucleotide 
25 sequence of any of (a)-(g); 

(j) a nucleotide sequence which encodes a polypeptide having at least 31% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; 
(k) a nucleotide sequence which encodes a polypeptide having at least 60% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; and 
30 (1) a nucleotide sequence which encodes a conserved domain of a polypeptide having at 

least 65% sequence identity to a conserved domain of a polypeptide of SEQ ID Nos. 2N, 
where N= 1-23. 
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5. The isolated or recombinant polynucleotide of claim 4, further comprising a constitutive, 
inducible, or tissue-active promoter operably linked to the nucleotide sequence, 

6. A cloning or expression vector comprising the isolated or recombinant polynucleotide of 
claim 4. 

7. . A cell comprising the cloning or expression vector of claim 6. 

8. A transgenic plant comprising the isolated or recombinant polynucleotide of claim 4. 

9. A composition produced by one or more of: 

(a) incubating one or more polynucleotide of claim 4 with a nuclease; 

(b) incubating one or more polynucleotide of claim 4 with a restriction enzyme; 

(c) incubating one or more polynucleotide of claim 4 with a polymerase; 

(d) incubating one or more polynucleotide of claim 4 with a polymerase and a primer; 

(e) incubating one or more polynucleotide of claim 4 with a cloning vector, or 

(f) incubating one or more polynucleotide of claim 4 with a cell. 

10. A composition comprising two or more different polynucleotides of claim 4. 

11. An isolated or recombinant polypeptide comprising a subsequence of at least about 15 
contiguous amino acids encoded by the recombinant or isolated polynucleotide of claim 4. 

12. A plant ectopically expressing an isolated polypeptide of claim 1 1. 

13. A method for producing a plant having a modified structure and development 
characteristic, the method comprising altering the expression of the isolated or recombinant 
polynucleotide of claim 4 or the expression levels or activity of a polypeptide of claim 1 1 in a 
plant, thereby producing a modified plant, and selecting the modified plant for modified structure 
and development characteristics thereby providing the modified plant with a modified structure 
and development characteristics. 

14. The method of claim 13, wherein the polynucleotide is a polynucleotide of claim 4. 
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15. A method of identifying a factor that is modulated by or interacts with a polypeptide 
encoded by a polynucleotide of claim 4, the method comprising: 

(a) expressing a polypeptide encoded by the polynucleotide in a plant; and 

(b) identifying at least one factor that is modulated by or interacts with the polypeptide. 

5 

16. The method of claim 15, wherein the identifying is performed by detecting binding by the 
polypeptide to a promoter sequence, or detecting interactions between an additional protein and 
the polypeptide in a yeast two hybrid system. 

10 17. The method of claim 15, wherein the identifying is performed by detecting expression of 
a factor by hybridization to a microarray, subtractive hybridization or differential display. 

18. A method of identifying a molecule that modulates activity or expression of a 
polynucleotide or polypeptide of interest, the method comprising: 

1 5 (a) placing the molecule in contact with a plant comprising the polynucleotide or 

polypeptide encoded by the polynucleotide of claim 4; and, 
(b) monitoring one or more of: 

(i) expression level of the polynucleotide in the plant; 

(ii) expression level of the polypeptide in the plant; 

20 (iii) modulation of an activity of the polypeptide in the plant; or 

(iv) modulation of an activity of the polynucleotide in the plant. 

19. An integrated system, computer or computer readable medium comprising one or more 
character strings corresponding to a polynucleotide of claim 4, or to a polypeptide encoded by the 

25 polynucleotide. 

20. The integrated system, computer or computer readable medium of claim 19, further 
comprising a link between said one or more sequence strings to a modified plant structure and 
development characteristics phenotype. 

30 

21. A method of identifying a sequence similar or homologous to one or more 
polynucleotides of claim 4, or one or more polypeptides encoded by the polynucleotides, the 
method comprising: 

(a) providing a sequence database; and, 
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(b) querying the sequence database with one or more target sequences corresponding to 
the one or more polynucleotides or to the one or more polypeptides to identify one or 
more sequence members of the database that display sequence similarity or homology to 
one or more of the one or more target sequences. 

5 

22, The method of claim 21, wherein the querying comprises aligning one or more of the 
target sequences with one or more of the one or more sequence members in the sequence 
database. 

10 23. The method of claim 21, wherein the querying comprises identifying one or more of the 
one or more sequence members of the database that meet a user-selected identity criteria with one 
or more of the target sequences. 

24. The method of claim 21, further comprising linking the one or more of the 

1 5 polynucleotides of claim 4, or encoded polypeptides, to a modified plant structure and 
development characteristics phenotype. 

25. A plant comprising altered expression levels of an isolated or recombinant polynucleotide 
of claim 4. 

20 

26. A plant comprising altered expression levels or the activity of an isolated or recombinant 
polypeptide of claim 1 1 . 

27. A plant lacking a nucleotide sequence encoding a polypeptide of claim 1 1 . 

25 
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Figure 1 



SEQ ID No. 


GID 


cDNA or ^otein 


conserved domain 


1 


G727 


cDNA 




2 


G727 


protein 


226-269 


3 


G732 


cDNA 




4 


G732 


protein 


31-9 


5 


G9 


cDNA 




6 


G9 


protein 


62-127 


7 


G428 


cDNA 




8 


G428 


protein 


229-292 


9 


G869 


cDNA 




10 


G869 


protein 


109-177 


11 


G1269 


cDNA 




12 


G1269 


protein 


27-83 


13 


G1038 


cDNA 




14 


G1038 


protein 


198-247 


15 


G438 


cDNA 




16 


G438 


protein 


22-85 


17 


G571 


cDNA 




18 


G571 


protein 


160-220 


19 


G748 


cDNA 




20 


G748 


protein 


112-140 


21 


G431 


cDNA 




22 


G431 


protein 


286-335 


23 


G187 


cDNA 




24 


G187 


protein 


172-228 


25 


G470 


cDNA 




26 


G470 


protein 


61-393 


27 


G615 


cDNA 




28 


G615 


protein 


88-147 


29 


G1073 


cDNA 




30 


G1073 


protein 


33-42. 78-175 
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Figure 2 



SEQ ID No. 


GID 


homolog 


cDNA or protein 


conserved domain 


31 


G1493 


homolog of G727 


cDNA 




32 


G1493 


homolog of G727 


protein 


242-289 


33 


G993 


homolog of G9 


cDNA 




34 


G993 


homolog of G9 


protein 


69-134 


35 


G867 


homolog of G9 


cDNA 




36 


G867 


homolog of G9 


protein 


59-124 


37 


G1930 


homolog of G9 


cDNA 




38 


G1930 


homolog of G9 


protein 


59-124 


39 


G1594 


homolog of G428 


cDNA 




40 


G1594 


homolog of G428 


protein 


262-325 


41 


G391 


homolog of G438 


cDNA 




42 


G391 


homolog of G438 


protein 


25-85 


43 


G390 


homolog of G438 


cDNA 




44 


G390 


homolog of G438 


protein 


18-81 


45 


G1548 


homolog of G438 


cDNA 




46 


G1548 


homolog of G438 


protein 


17-77 
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Figure 3A 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


1 


G727 


7283684 


2.20E-56 


Glycine max 




G727 


7206180 


8.40E-42 


Medicago truncatula 




G727 


7614196 


2.20E-40 


Lotus japonicus 


1 


G727 


572293 


1.20E-31 


Oryza sativa 


1 


G727 


7218448 


7.70E-30 


Sorghum bicolor 


1 


G727 


9291284 


1.80E-27 


Lycopersicon hirsutum 


1 


G727 


8901641 


5.10E-27 


Hordeum vulgare 


1 


G727 


8380453 


6.60 E-24 


Gossypium arboreum 


1 


G727 


9962201 


2.10E-12 


Cryptomeria japonica 


1 


G727 


8122498 


3.10E-08 


Lycopersicon esculentum 


3 


G732 


5048074 


5.60E-30 


Gossypium hirsutum 


3 


G732 


4384142 


6.10E-30 


Lycopersicon esculentum 


3 


G732 


7623218 


6.10E-30 


Gossypium arboreum 


3 


G732 


4457220 


1 .80E-29 


Capsicum chinense 


3 


G732 


7284989 


4.50E-28 


Glycine max 


3 


G732 


9650827 


1 .20E-27 


Petroselinum crispum 


3 


G732 


7205618 


2.20E-26 


Medicago truncatula 


3 


G732 


3854258 


1 .40E-22 


Populus tremula x Populus tremuloides 


5 


G9 


7643366 


6.80E-56 


Medicago truncatula 


5 


G9 


8669779 


4.20E-50 


Glycine max 


5 


G9 


8329389 


1 .50E-48 


Mesembryanthemum crystallinum 


5 


G9 


9851335 


3.50E-42 


Sorghum bicolor 


5 


G9 


7412012 


1.50E-41 


Lycopersicon esculentum 


5 


G9 


10450225 


1 ,30E-38 


Solanum tuberosum 


5 


G9 


8902194 


8.30E-36 


Hordeum vulgare 


5 


G9 


7722547 


2.60E-33 


Lotus japonicus 


5 


G9 


9696857 


1 .90E-32 


Triticum aestivum 


5 


G9 


7324245 


2.40E-32 


Lycopersicon pennellii 


7 


G428 


3327268 


5.50E-65 


Ipomoea nil 


7 


G428 


4589883 


1 .20E-60 


Nicotiana tabacum 


7 


G428 


1814233 


2.20E-56 


Solanum tuberosum 


7 


G428 


7581978 


8.50E-56 


Dendrobium grex Madame Thong-In 


7 


G428 


4098241 


1 .50E-53 


Lycopersicon esculentum 


7 


G428 


4099825 


1 .30E-38 


Picea marlana 


7 


G428 


346261 1 


2.50E-38 


Pisum sativum 


7 


G428 


3928842 


1.90E-37 


Picea abies 


7 


G428 


9699343 


2.70E-35 


Triticum aestivum 


7 


G428 


1008878 


4.80E-35 


Zea mays 


9 


G869 


10235055 


1.00E-19 


Glycine max 


9 


G869 


2213784 


1.60E-19 


Lycopersicon esculentum 


9 


G869 


3065894 


9.20E-19 


Nicotiana tabacum 


9 


G869 


8570080 


5.30E-18 


Oryza sativa 


9 


G869 


7560260 


1.90E-17 


Medicago truncatula 


9 


G869 


9850452 


9.30E-16 


Sorghum bicolor 


9 


G869 


9963144 


1 .10E-13 


Cryptomeria japonica 


9 


G869 


9660634 


1.90E-13 


Secale cereale 


9 


G869 


9362061 


3.40E-13 


Triticum aestivum 


9 


G869 


7788764 


7.20E-13 


Lotus japonicus 


11 


G1269 


9565366 


7.00E-37 


Glycine max 


11 


G1269 


5272360 


8.10E-37 


Lycopersicon esculentum 


11 


G1269 


9119112 


8.40E-28 


Medicago truncatula 


11 


G1269 


9852711 


2.10E-22 


Sorghum bicolor 
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Figure 3B 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


11 


G1269 


9255178 


1.10E-18 


Zea mays 


11 


G1269 


10447957 


8.60E-15 


Solanum tuberosum 


11 


G1269 


9435251 


1 .20E-09 


Hordeum vulgare 


11 


G1269 


3858030 


3.20E-09 


Populus balsamifera subsp. trichocarpa 


11 


G1269 


9696112 


3.80E-09 


Triticum aestivum 


11 


G1269 


8213273 


4.90E-09 


Oryza sativa 


13 


G1038 


8748344 


8.00E-37 


Medicago truncatula 


13 


G1038 


7283684 


5.20E-36 


Glycine max 


13 


G1038 


7218448 


8.80E-36 


Sorghum bicolor 


13 


G1038 


572293 


3.30E-35 


Oryza sativa 


13 


G1038 


8901641 


4.30E-28 


Hordeum vulgare 


13 


G1038 


9962201 


2.20E-16 


Cryptomeria japonica 


13 


G1038 


7614196 


6.50E-11 


Lotus japonicus 


13 


G1038 


9291272 


0.00015 


Lycopersicon hirsutum 


13 


G1038 


8122498 


0.0005 


Lycopersicon esculentum 


13 


G1038 


9883662 


0.68 


Triticum aestivum 


15 


G438 


7209474 


8.70E-204 


Oryza sativa 


15 


G438 


7209911 


2.20E-142 


Physcomitrella patens 


15 


G438 


7571387 


2.30E-80 


Medicago truncatula 


15 


G438 


8330425 


3.00E-66 


Mesembryanthemum crystallinum 


15 


G438 


6531152 


1 .60E-64 


Lycopersicon esculentum 


15 


G438 


6726825 


4.70E-61 


Glycine max 


15 


G438 


5269007 


7.00E-54 


Zea mays 


15 


G438 


9253000 


1 .70E-47 


Solanum tuberosum 


15 


G438 


8967371 


4.40E-46 


Hordeum vulgare 


15 


G438 


2963336 


1 .60E-34 


Pinus taeda 


17 


G571 


6288681 


1 .50E-70 


Nicotiana tabacum 


17 


G571 


297019 


1 .60E-68 


Zea mays 


17 


G571 


10423526 


2.20E-61 


Oryza sativa 


17 


G571 


5926681 


4.20E-61 


Triticum aestivum 


17 


G571 


4959969 


1.90E-59 


Lycopersicon esculentum 


17 


G571 


1372965 


1 .20E-56 


Vicia faba 


17 


G571 


8098832 


1 .20E-46 


Hordeum vulgare 


17 


G571 


9566058 


2.00E-43 


Glycine max 


17 


G571 


765198 


1.50E-41 


Solanum tuberosum 


17 


G571 


19679 


3.80E-41 


Nicotiana sp. 


19 


G748 


853689 


7.00E-87 


Cucurbita maxima 


19 


G748 


7242897 


3.90E-59 


Oryza sativa 


19 


G748 


5888560 


1 .20E-45 


Lycopersicon esculentum 


19 


G748 


6341666 


5.60E-38 


Glycine max 


19 


G748 


10700058 


1.10E-36 


Medicago truncatula 


19 


G748 


7535776 


5.00E-33 


Sorghum bicolor 


19 


G748 


9419494 


2.10E-31 


Hordeum vulgare 


19 


G748 


9410157 


1.00E-28 


Triticum aestivum 


19 


G748 


3929324 


4.30E-25 


Dendrobium grex Madame Thong-IN 


19 


G748 


1 0449922 


2.30E-23 


Solanum tuberosum 


21 


G431 


7340349 


9.90E-177 


Brassica oleracea 


21 


G431 


3462611 


1.20E-112 


Pisum sativum 


21 


G431 


310568 


1.50E-112 


Glycine max 


21 


G431 


2251078 


1.90E-107 


Nicotiana tabacum 


21 


G431 


4098239 


1.20E-104 


Lycopersicon esculentum 


21 


G431 


1008878 


4.90E-62 


Zea mays 


21 


G431 


6942299 


7.90E-62 


Triticum aestivum 
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Figure 3C 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


21 


G431 


3327239 


1.90E-61 


Oryza sativa 


21 


G431 


3928842 


1.60E-59 


Picea abies 


21 


G431 


2522483 


2.30E-59 


Hordeum vulgare 


23 


G187 


9304207 


2.10E-35 


Sorghum bicolor 


23 


G187 


9444636 


3.20E-34 


Triticum aestivum 


23 


G187 


5058292 


3.60E-34 


Glycine max 


23 


G187 


7721184 


2.40E-32 


Lotus japonicus 


23 


G187 


7562279 


1 .20E-31 


Medicago truncatula 


23 


G187 


8105974 


3.00E-29 


Lycopersicon esculentum 


23 


G187 


9049477 


1 .60E-27 


Oryza sativa 


23 


G187 


9187621 


1 .60E-23 


Solanum tuberosum 


23 


G187 


5268376 


5.60E-23 


Zea mays 


23 


G187 


4894964 


1 .70E-22 


Avena sativa 


25 


G470 


6917173 


4.80E-78 


Lycopersicon pennellii 


25 


G470 


8827792 


8.50E-70 


Glycine max 


25 


G470 


5272309 


7.40E-69 


Lycopersicon esculentum 


25 


G470 


7563870 


6.70E-68 


Medicago truncatula 


25 


G470 


5296108 


5.50E-65 


Zea mays 


25 


G470 


7339690 


7.40E-57 


Oryza sativa 


25 


G470 


5047367 


1.30E-51 


Gossypium hirsutum 


25 


G470 


9856054 


9.70E-50 


Sorghum bicolor 


25 


G470 


3857884 


1.10E-38 


Populus balsamifera subsp. trichocarpa 


25 


G470 


8174666 


6.40E-37 


Hordeum vulgare 


27 


G615 


5566284 


2.00E-28 


Linaria vulgaris 


27 


G615 


6358617 


3.20E-27 


Antirrhinum graniticum 


27 


G615 


6358613 


1 .40E-26 


Antirrhinum majus subsp. cirrhigerum 


27 


G615 


6358545 


8.60E-26 


Digitalis purpurea 


27 


G615 


6358538 


1 .40E-25 


Antirrhinum braun-blanquetii 


27 


G615 


6358541 


1 .40E-25 


Misopates orontium 


27 


G615 


6358542 


1.40E-25 


Antirrhinum molle 


27 


G615 


6358573 


1.40E-25 


Misopates calycinum 


27 


G615 


6358546 


1 .80E-25 


Antirrhinum siculum 


27 


G615 


2826867 


2.70E-25 


Antirrhinum majus 


29 


G1073 


7238733 


2.70E-55 


Medicago truncatula 


29 


G1073 


10843924 


1 .50E-44 


Glycine max 


29 


G1073 


7615218 


2.00E-42 


Lotus japonicus 


29 


G1073 


7333102 


3.40E-34 


Lycopersicon esculentum 


29 


G1073 


9689692 


8.60E-28 


Pinus taeda 


29 


G1073 


9445090 


4.30E-25 


Triticum aestivum 


29 


G1073 


9252370 


2.80E-24 


Solanum tuberosum 


29 


G1073 


5042437 


5.80E-21 


Oryza sativa 


29 


G1073 


7536402 


6.70E-20 


Sorghum bicolor 


29 


G1073 


9662742 


2.70E-19 


Secale cereale 


31 


G1493 


7614196 


2.20E-50 


Lotus japonicus 


31 


G1493 


9986889 


6.10E-48 


Glycine max 


31 


G1493 


8748344 


2.20E-38 


Medicago truncatula 


31 


G1493 


572293 


1 ,70E-37 


Oryza sativa 


31 


G1493 


7218448 


5.70E-33 


Sorghum bicolor 


31 


G1493 


9291284 


9.70E-32 


Lycopersicon hirsutum 


31 


G1493 


8380453 


1 .60E-30 


Gossypium arboreum 


31 


G1493 


8901641 


1.70E-30 


Hordeum vulgare 


31 


G1493 


9962201 


6.90E-17 


Cryptomeria japonica 


31 


G1493 


8122498 


1 .50E-08 


Lycopersicon esculentum 
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Figure 3D 



cpr> in Kin 




vj6nDani\ INIU 






33 


G993 


7643366 


1 .20E-58 


Medicago truncatula 


33 


G993 


8329389 


1 .OOE-49 


Mesembryanthemum crystallinum 


33 


G993 


8669779 


D.10E-49 


Glycine max 


33 


G993 


9851335 


6.30E-43 


Sorghum bicolor 


33 


G993 


4384549 


5.20E-40 


Lycopersicon esculentum 


33 


G993 


10450225 


3.70E-39 


Solanum tuberosum 


33 


G993 


8902194 


2.50E-34 


Hordeum vulgare 


33 


G993 


7719409 


1 .30E-32 


Lotus japonicus 


33 


G993 


8749037 


5.20E-32 


Citrus X paradisi 


33 


G993 


9247126 


1 .30E-30 


Oryza sativa 


35 


G867 


7643366 


2.20E-57 


Medicago truncatula 


35 


G867 


8329389 


1 .10E-50 


Mesembryanthemum crystallinum 


35 


G867 


8669779 


2.70E-46 


Glycine max 


35 


G867 


1 0450225 


3.60E-41 


Solanum tuberosum 


35 


G867 


9851335 


2.80E-40 


Sorghum bicolor 


35 


G867 


9430646 


7.20E-40 


Lycopersicon esculentum 


35 


G867 


8902194 


1 .60E-34 


Hordeum vulgare 


35 


G867 


7722547 


1 .30E-33 


Lotus japonicus 


35 


G867 


7324245 


3.90E-32 


Lycopersicon pennellii 


35 


G867 


8749037 


1 .40E-31 


Citrus X paradisi 


37 


G1930 


7643366 


9.70E-57 


Medicago truncatula 


37 


G1930 


8329389 


4.50E-47 


Mesembryanthemum crystallinum 


37 


G1930 


6069592 


1 .10E-46 


Glycine max 


37 


G1930 


1 0450225 


6.50E-42 


Solanum tuberosum 


37 


G1930 


9430646 


8.20E-39 


Lycopersicon esculentum 


37 


G1930 


9851335 


1 .80E-38 


Sorghum bicolor 


37 


G1930 


7722547 


4.70E-34 


Lotus japonicus 


37 


01 930 


7324245 


1 .20E-32 


Lycopersicon pennellii 


37 


G1930 


8902194 


3.00E-31 


Hordeum vulgare 


37 


G1930 


9697984 


4.60E-29 


Triticum aestivum 


39 


G1594 


3327268 


2.60E-74 


Ipomoea nil 


39 


G1594 


7581978 


9.20E-62 


Dendrobium grex Madame Thong-In 


39 


G1594 


4887609 


1 .50E-47 


Oryza sativa 


39 


G1594 


1814233 


4.00E-46 


Solanum tuberosum 


39 


G1594 


4589883 


6.30E-43 


Nicotiana tabacum 


39 


G1594 


A r\r\n*^ a a 

4098241 


6.70E-43 


Lycopersicon esculentum 


39 


G1594 


3928842 


2.00E-42 


Picea abies 


39 


G1594 


4099825 


2.60E-42 


Picea mariana 


39 


G1594 


4240538 


1.70E-41 


Zea mays 


39 


G1594 


1946219 


1 .90E-41 


Malus domestica 


41 


G391 


7209474 


4.70E-194 


Oryza sativa 


41 


G391 


720991 1 


2.10E-145 


Physcomitrella patens 


41 


G391 


7560927 


8.70E-67 


Medicago truncatula 


41 


G391 


10808354 


1 .50E-61 


Solanum tuberosum 


41 


G391 


5893826 


7.00E-60 


Lycopersicon esculentum 


41 




oooU'l-ZO 


o.Dut-oy 


iviesemDryaninemum crysiaiiinum 


41 


G391 


8284059 


8.70E-57 


Glycine max 


41 


G391 


5269007 


8.10E-46 


Zea mays 


41 


G391 


9419425 


1.70E-43 


Hordeum vulgare 


41 


G391 


2963336 


2.10E-37 


Pinus taeda 


43 


G390 


7209474 


2.50E-166 


Oryza sativa 


43 


G390 


7209911 


1.70E-149 


Physcomitrella patens 


43 


G390 


7560927 


5.80E-81 


Medicago truncatula 
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Figure 3E 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


43 


G390 


7409018 


1 .80E-68 


Lycopersicon esculentum 


43 


G390 


8071613 


3.00E-60 


Solanum tuberosum 


43 


G390 


9466042 


1 .60E-59 


Hordeum vulgare 


43 


G390 


8284059 


1 .OOE-57 


Glycine max 


43 


G390 


8330425 


2.60E-44 


Mesembryanthemum crystallinum 


43 


G390 


5269007 


4.60E-44 


Zea mays 


43 


G390 


2963336 


4.90E-43 


Pinus taeda 


45 


G1548 


7209474 


5.90E-169 


Oryza sativa 


45 


G1548 


7209911 


3.30E-140 


Physcomitrella patens 


45 


G1548 


9253000 


1 .60E-76 


Solanum tuberosum 


45 


G1548 


9820423 


1 .40E-67 


Glycine max 


45 


G1548 


7570825 


8.40E-67 


Medicago truncatula 


45 


G1548 


9456848 


2.70E-55 


Lycopersicon esculentum 


45 


G1548 


9419425 


1 .40E-47 


Hordeum vulgare 


45 


G1548 


6626571 


3.50E-46 


Zea mays 


45 


G1548 


8330425 


4.20E-46 


Mesembryanthemum crystallinum 


45 


G1548 


3853847 


2.70E-42 


Populus tremula x Populus tremuloides 
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MBI0018 Sequence Listing. ST25 
SEQUENCE LISTING 



<110> Riechmann, Jose Luis 
Reuber, Lynne 
Keddie, James 
Ratcliffe, Oliver 
Heard, Jacqueline 
Samaha, Raymond 
Yu, Guo- Liang 
Jiang, Cai-Zhong 

<:120> Plant Developmental Genes 



<130> MBI-0018 



<150> 60/164,132 
<151> 1999-11-17 



<150> 60/197,899 
<151> 2000-04-17 



<150> Plant Trait Modification III 

<151> 2000-08-22 

<160> 46 

<170> Patentin version 3.0 



<210> 1 

<211> 2007 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (43) . . (1977) 

<223> G727 

<400> 1 

cttcttctcc ttctctgatc gttcgttttc tggacgagag ag atg gta aat ccg 54 

Met Val Asn Pro 
1 

ggt cac gga aga gga ccc gat teg ggt act get get ggt ggg tea aae 102 
Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala Gly Gly Ser Asn 
5 10 15 20 

tec gae ccg ttt cet geg aat ett cga gtt ctt gtc gtt gat gat gat 150 
Ser Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val Val Asp Asp Asp 
25 30 35 

cca act tgt etc atg ate tta gag agg atg ett atg act tgt etc tac 198 
Pro Thr Cys Leu Met lie Leu Glu Arg Met Leu Met Thr Cys Leu Tyr 
40 45 50 

aga gag eag aga gcg cat tgt etc tgc tte gga aga aca aag aat ggt 246 
Arg Glu Gin Arg Ala His Cys Leu Cys Phe Gly Arg Thr Lys Asn Gly 
55 60 65 

ttt gat att gte att agt gat gtt eat atg cet gae atg gat ggt tte 294 
Phe Asp lie Val lie Ser Asp Val His Met Pro Asp Met Asp Gly Phe 
70 75 80 

aag etc ctt gaa cac gtt ggt tta gag atg gat tta cet gtt ate aat 342 
Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val lie Asn 
85 90 95 100 

ctg aat gtt ttg aaa cet ttg gtt ata gtg atg tct gcg gat gat teg 390 
Leu Asn Val Leu Lys Pro Leu Val lie Val Met Ser Ala Asp Asp Ser 
105 110 115 

aag age gtt gtg ttg aaa gga gtg act cac ggt gca gtt gat tac etc 438 
Lys Ser Val Val Leu Lys Gly Val Thr His Gly Ala Val Asp Tyr Leu 
120 125 130 
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ate aaa ccg gta cgt att gag get ttg aag aat ata tgg caa cat gtg 

lie Lys Pro Val Arg He Glu Ala Leu Lys Asn He Trp Gin His Val 
135 140 145 

gtg egg aag aag cgt aac gag tgg aat gtt tct gaa cat tct gga gga 

Val Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu His Ser Gly Gly 
150 155 160 

agt att gaa gat act ggc ggt gac agg gac agg cag cag cag cat agg 

Ser He Glu Asp Thr Gly Gly Asp Arg Asp Arg Gin Gin Gin His Arg 

165 170 175 180 

gag gat get gat aac aac teg tct tea gtt aat gaa ggg aac ggg agg 

Glu Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu Gly Asn Gly Arg 

185 190 195 

age teg agg aag egg aag gaa gag gaa gta gat gat caa ggg gat gat 

Ser Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp Gin Gly Asp Asp 
200 205 210 

aag gaa gac tea teg agt tta aag aaa eca egc gtg gtt tgg tct gtt 

Lys Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val 
215 220 225 

gaa ttg cat cag cag ttt gtt get get gtg aat cag eta ggc gtt gac 

Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin Leu Gly Val Asp 
230 235 240 

agt gag tta aaa act tge ttg ctt atg cat ttg tgt gtg teg att ggt 

Ser Glu Leu Lys Thr Cys Leu Leu Met His Leu Cys Val Ser lie Gly 

245 250 255 260 

aac att gtg gaa ttc cag aag tat egg ata tat etg aga egg ctt gga 

Asn He Val Glu Phe Gin Lys Tyr Arg He Tyr Leu Arg Arg Leu Gly 

265 270 275 

gga gta teg caa cae caa gga aat atg aac eat teg ttt atg act ggt 

Gly Val Ser Gin His Gin Gly Asn Met Asn His Ser Phe Met Thr Gly 
280 285 290 

caa gat cag agt ttt gga cct ctt tct teg ttg aat gga ttt gat ctt 

Gin Asp Gin Ser Phe Gly Pro Leu Ser Ser Leu Asn Gly Phe Asp Leu 
295 300 305 

caa tct tta get gtt act ggt cag etc cct cct cag age ctt gea cag 

Gin Ser Leu Ala Val Thr Gly Gin Leu Pro Pro Gin Ser Leu Ala Gin 
310 315 320 

ctt caa gea get ggt ctt ggc egg cct aca etc get aaa eca ggg atg 

Leu Gin Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala Lys Pro Gly Met 

325 330 335 340 

teg gtt tct ece ctt gta gat cag aga age ate ttc aac ttt gaa aac 

.Ser Val Ser Pro Leu Val Asp Gin Arg Ser He Phe Asn Phe Glu Asn 

345 350 355 

eca aaa ata aga ttt gga gac gga eat ggt cag aeg atg aac aat gga 

Pro Lys He Arg Phe Gly Asp Gly His Gly Gin Thr Met Asn Asn Gly 
360 365 370 

aat ttg ctt cat ggt gtc eca aeg ggt agt cac atg cgt etg cgt cct 

Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met Arg Leu Arg Pro 
375 380 385 

gga cag aat gtt cag age age gga atg atg ttg eca gta gea gac cag 

Gly Gin Asn Val Gin Ser Ser Gly Met Met Leu Pro Val Ala Asp Gin 

390 395 400 

eta cct ega gga gga eca teg atg eta eca tee etc ggg caa cag ccg 

Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu Gly Gin Gin Pro 

405 410 415 420 

ata ttg tea age age gtt tea aga aga age gat etc act ggt gcg etg 

He Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu Thr Gly Ala Leu 
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486 

534 

582 

630 

678 

726 

774 

822 

870 

918 

966 
1014 
1062 
1110 
1158 
1206 
1254 
1302 
1350 
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425 430 435 

gcg gtt aga aac agt ate ccc gag acc aac age aga gtg tta cca act 1398 
Ala Val Arg Asn Ser lie Pro Glu Thr Asn Ser Arg Val Leu Pro Thr 
440 445 450 

act cac teg gtc ttc aat aac ttc ccc gcg gat eta act cgc age age 144 6 

Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu Pro Arg Ser Ser 
455 460 465 

ttc ccg ttg gca agt gee cca ggg att tea gtt cca gta tea gtt tct 1494 
Phe Pro Leu Ala Ser Ala Pro Gly lie Ser Val Pro Val Ser Val Ser 
470 475 480 

tac caa gaa gag gtc aac age teg gat gca aaa gga ggt tea tea get 154 2 

Tyr Gin Glu Glu Val Asn Ser Ser Asp Ala Lys Gly Gly Ser Ser Ala 

485 490 495 500 

get act get gga ttt ggt aac cca age tac gae ata ttt aac gat ttt 1590 
Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp lie Phe Asn Asp Phe 
505 510 . 515 

ccg cag cac caa eag cae aac aag aac ate age aat aaa eta aac gat 1638 
Pro Gin His Gin Gin His Asn Lys Asn lie Ser Asn Lys Leu Asn Asp 
520 525 530 

tgg gat etg egg aat atg gga ttg gte tte agt tec aat eag gae gca 1686 
Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser Asn Gin Asp Ala 
535 540 545 

gca act gea acc gca acc gca gca ttt tee act teg gaa gca tac tct 1734 
Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser Glu Ala Tyr Ser 
550 555 560 

teg tct tct acg eag aga aaa aga egg gaa acg gae gca aca gtt gtg 1782 
Ser Ser Ser Thr Gin Arg Lys Arg Arg Glu Thr Asp Ala Thr Val Val 
565 570 575 580 

9gt gag cat ggg cag aac ctg cag tea ccg age egg aat ctg tat eat 183 0 

Gly Glu His Gly Gin Asn Leu Gin Ser Pro Ser Arg Asn Leu Tyr His 
585 590 595 

ctg aac cac gtt ttt atg gae ggt ggt tea gtc aga gtg aag tea gaa 1878 
Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg Val Lys Ser Glu 
600 605 610 

aga gtg gcg gag aca gtg act tgt cet cca gca aat aca ttg ttt cac 1926 
Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn Thr Leu Phe His 

615 620 625 

gag cag tat aat caa gaa gat ctg atg age gca ttt etc aaa cag gtt 1974 
Glu Gin Tyr Asn Gin Glu Asp Leu Met Ser Ala Phe Leu Lys Gin Val 
630 635 640 

tga ttattacteg aatacagtgc actctaaaac 2007 



<210> 2 
<211> 644 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Val Asn Pro Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala 
15 10 15 



Gly Gly Ser Asn Ser Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val 

20 25 30 

Val Asp Asp Asp Pro Thr Cys Leu Met lie Leu Glu Arg Met Leu Met 
35 40 45 
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Thr Cys Leu Tyr Arg Glu Gin Arg Ala His Cys Leu Cys Phe Gly Arg 
50 55 60 

Thr Lys Asn Gly Phe Asp lie Val lie Ser Asp Val His Met Pro Asp 
65 70 75 80 

Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu 
85 90 95 

Pro Val lie Asn Leu Asn Val Leu Lys Pro Leu Val lie Val Met Ser 
100 105 110 

Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val Thr His Gly Ala 
115 120 125 

Val Asp Tyr Leu lie Lys Pro Val Arg lie Glu Ala Leu Lys Asn lie 
130 135 140 

Trp Gin His Val Val Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu 
145 150 155 160 

His Ser Gly Gly Ser lie Glu Asp Thr Gly Gly Asp Arg Asp Arg Gin 
165 170 175 

Gin Gin His Arg Glu Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu 
180 185 190 

Gly Asn Gly Arg Ser Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp 
195 200 205 

Gin Gly Asp Asp Lys Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val 
210 215 220 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
225 230 235 240 

Leu Gly Val Asp Ser Glu Leu Lys Thr Cys Leu Leu Met His Leu Cys 
245 250 255 

Val Ser lie Gly Asn lie Val Glu Phe Gin Lys Tyr Arg lie Tyr Leu 
260 265 270 

Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn Met Asn His Ser 
275 280 285 

Phe Met Thr Gly Gin Asp Gin Ser Phe Gly Pro Leu Ser Ser Leu Asn 
290 295 300 

Gly Phe Asp Leu Gin Ser Leu Ala Val Thr Gly Gin Leu Pro Pro Gin 
305 310 315 320 

Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala 
325 330 335 

Lys Pro Gly Met Ser Val Ser Pro Leu Val Asp Gin Arg Ser lie Phe 
340 345 350 

Page 4 



wo 01/36444 



PCT/USOO/31325 



MBI0 018 Sequence Listing.ST25 

Asn Phe Glu Asn Pro Lys He Arg Phe Gly Asp Gly His Gly Gin Thr 
355 360 365 

Met Asn Asn Gly Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met 
370 375 380 

Arg Leu Arg Pro Gly Gin Asn Val Gin Ser Ser Gly Met Met Leu Pro 
385 390 395 400 

Val Ala Asp Gin Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu 
405 410 415 

Gly Gin Gin Pro He Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu 
420 425 430 

Thr Gly Ala Leu Ala Val Arg Asn Ser He Pro Glu Thr Asn Ser Arg 
435 440 445 

Val Leu Pro Thr Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu 
450 455 460 

Pro Arg Ser Ser Phe Pro Leu Ala Ser Ala Pro Gly He Ser Val Pro 
465 470 475 480 

Val Ser Val Ser Tyr Gin Glu Glu Val Asn Ser Ser Asp Ala Lys Gly 
485 490 495 

Gly Ser Ser Ala Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp He 
500 505 510 

Phe Asn Asp Phe Pro Gin His Gin Gin His Asn Lys Asn He Ser Asn 
515 520 525 

Lys Leu Asn Asp Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser 
530 535 540 

Asn Gin Asp Ala Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser 
545 550 555 560 

Glu Ala Tyr Ser Ser Ser Ser Thr Gin Arg Lys Arg Arg Glu Thr Asp 
565 570 575 

Ala Thr Val Val Gly Glu His Gly Gin Asn Leu Gin Ser Pro Ser Arg 
580 585 590 

Asn Leu Tyr His Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg 
595 600 605 

Val Lys Ser Glu Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn 
610 615 620 

Thr Leu Phe His Glu Gin Tyr Asn Gin Glu Asp Leu Met Ser Ala Phe 
625 630 635 640 

Leu Lys Gin Val 
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<210> 


3 


<211> 


834 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(73) . . (588) 


<223> 


G732 



<400> 3 

aaaaaaacca aacataaaac ataaaactct gtcctttttt tgtcttcttg taacttttct 60 

tgttaaaaat ca atg gcg tea tct age age aea tac egg age tea age tet 111 
Met Ala Ser Ser Ser Ser Thr Tyr Arg Ser Ser Ser Ser 
15 10 

tec gae ggt ggt aat aat aac ccg teg gae tec gtc gtc acc gtc gac 159 
Ser Asp Gly Gly Asn Asn Asn Pro Ser Asp Ser Val Val Thr Val Asp 
15 20 25 

gaa cga aaa cgt aaa aga atg tta teg aac aga gaa tct gca cgt agg 207 
Glu Arg Lys Arg Lys Arg Met Leu Ser Asn Arg Glu Ser Ala Arg Arg 
30 35 40 45 

tea agg atg cgt aaa cag aaa cac gtt gat gat eta acg get cag ate 255 
Ser Arg Met Arg Lys Gin Lys His Val Asp Asp Leu Thr Ala Gin lie 

50 55 60 

aat cag eta tea aac gac aac cgt cag ate ttg aac age etc acc gta 3 03 

Asn Gin Leu Ser Asn Asp Asn Arg Gin lie Leu Asn Ser Leu Thr Val 
65 70 75 

aea tct cag ett tac atg aag ate eaa gee gag aac tet gtt etc ace 351 
Thr Ser Gin Leu Tyr Met Lys lie Gin Ala Glu Asn Ser Val Leu Thr 
80 85 90 

get eag atg gag gag ett age acc aga etc eaa tct etc aac gag ate 399 
Ala Gin Met Glu Glu Leu Ser Thr Arg Leu Gin Ser Leu Asn Glu lie 
95 100 105 

gtt gat ett gtt eaa tee aac ggt gea gga ttt ggt gtt gae eag ate 447 
Val Asp Leu Val Gin Ser Asn Gly Ala Gly Phe Gly Val Asp Gin lie 
110 115 120 125 

gae gge tgt ggt ttt gat gat cgt aeg gtt ggg ate gae gga tat tac 495 
Asp Gly Cys Gly Phe Asp Asp Arg Thr Val Gly lie Asp Gly Tyr Tyr 
130 135 140 

gat gat atg aat atg atg agt aat gtt aat eat tgg ggt ggt teg gtt 543 
Asp Asp Met Asn Met Met Ser Asn Val Asn His Trp Gly Gly Ser Val 
145 150 155 

tac act aac caa ccc att atg get aat gat ate aat atg tat tga 588 
Tyr Thr Asn Gin Pro lie Met Ala Asn Asp lie Asn Met Tyr 
160 165 170 



ttaataaaat 


taattaaaat 


aattagatgc 


eccttttttg 


tetttttatt 


ttaaaattta 


648 


geccattttg 


gtgtttttgg 


gttggtgtga 


tgatgtaatt 


atagtaeatg 


eatetttgat 


708 


tggttggaag 


gataaatata 


aaetttatat 


atatattggg 


geatatatat 


atgagttgta 


768 


etttgeatgt 


attggtgtgt 


gttttgttat 


aattatatga 


ttatatatgt 


ttatgttaaa 


828 


aaaaaa 












834 



<210> 4 
<211> 171 
<212> PRT 
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<213> Arabidopsis thaliana 
<400> 4 

Met Ala Ser Ser Ser Ser Thr Tyr Arg Ser Ser Ser Ser Ser Asp Gly 
15 10 15 

Gly Asn Asn Asn Pro Ser Asp Ser Val Val Thr Val Asp Glu Arg Lys 
20 25 30 

Arg Lys Arg Met Leu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg Met 
35 40 45 

Arg Lys Gin Lys His Val Asp Asp Leu Thr Ala Gin lie Asn Gin Leu 
50 55 60 

Ser Asn Asp Asn Arg Gin He Leu Asn Ser Leu Thr Val Thr Ser Gin 
65 70 75 80 

Leu Tyr Met Lys He Gin Ala Glu Asn Ser Val Leu Thr Ala Gin Met 
85 90 95 

Glu Glu Leu Ser Thr Arg Leu Gin Ser Leu Asn Glu He Val Asp Leu 
100 105 110 

Val Gin Ser Asn Gly Ala Gly Phe Gly Val Asp Gin He Asp Gly Cys 
115 120 125 

Gly Phe Asp Asp Arg Thr Val Gly He Asp Gly Tyr Tyr Asp Asp Met 
130 135 140 

Asn Met Met Ser Asn Val Asn His Trp Gly Gly Ser Val Tyr Thr Asn 
145 150 155 160 

Gin Pro He Met Ala Asn Asp He Asn Met Tyr 
165 170 



<210> 


5 


<211> 


1246 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(81) . . (1139) 


<223> 


G9 


<400> 


5 



gtgtttcttc tttctgctaa aaggttataa tttttgtttc ttggtttggt gagaatcttc 60 

aagaaactga aacaaagaaa atg gat tct agt tgc ata gac gag ata agt tec 113 

Met Asp Ser Ser Cys He Asp Glu He Ser Ser 
15 10 

tec act tea gaa tct ttc tec gee ace ace gee aag aag etc tct cct 161 
Ser Thr Ser Glu Ser Phe Ser Ala Thr Thr Ala Lys Lys Leu Ser Pro 
15 20 25 

cct ccc gcg gcg gcg tta cgc etc tac egg atg gga age gge ggg age 2 09 

Pro Pro Ala Ala Ala Leu Arg Leu Tyr Arg Met Gly Ser Gly Gly Ser 
30 35 40 

age gtc gtg ttg gat ccc gag aac ggc eta gag aeg gag tea cga aag 257 
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Ser Val Val Leu Asp Pro Glu Asn Gly Leu Glu Thr Glu Ser Arg Lys 
45 50 55 

eta cca tct tea aaa tac aaa ggt gtt gtt cct cag cct aac gga aga 305 
Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro Asn Gly Arg 
60 65 70 75 

tgg gga get cag ate tac gag aag cae caa ega gta tgg etc ggg act 353 
Trp Gly Ala Gin lie Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr 
80 85 90 

tte aac gag caa gaa gaa get get egt tee tac gae ate gea get tgt 401 
Phe Asn Glu Gin Glu Glu Ala Ala Arg Ser Tyr Asp lie Ala Ala Cys 
95 100 105 

aga ttc cgt ggc cgc gac gcc gtc gtc aac ttc aag aac gtt ctg gaa 449 
Arg Phe Arg Gly Arg Asp Ala Val Val Asn Phe Lys Asn Val Leu Glu 
110 115 120 

gac ggc gat tta get ttt ctt gaa get cac tea aag gcc gag ate gtc 497 
Asp Gly Asp Leu Ala Phe Leu Glu Ala His Ser Lys Ala Glu lie Val 
125 130 135 

gae atg ttg aga aaa eac act tac gee gae gag ctt gaa cag aac aat 545 
Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Leu Glu Gin Asn Asn 
140 145 150 155 

aaa egg eag ttg ttt etc tee gtc gae get aae gga aaa egt aac gga 593 
Lys Arg Gin Leu Phe Leu Ser Val Asp Ala Asn Gly Lys Arg Asn Gly 
160 165 170 

teg agt act act caa aae gae aaa gtt tta aag aeg tgt gaa gtt ctt 641 
Ser Ser Thr Thr Gin Asn Asp Lys Val Leu Lys Thr Cys Glu Val Leu 

175 180 185 

tte gag aag get gtt aca cct age gac gtt ggg aag eta aae egt etc 689 
Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu 

190 195 200 

gtg ata cct aaa caa cac gcc gag aaa cae ttt eeg tta ceg tea eeg 73 7 

Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu Pro Ser Pro 
205 210 215 

tea eeg gea gtg act aaa gga gtt ttg ate aac ttc gaa gac gtt aac 785 
Ser Pro Ala Val Thr Lys Gly Val Leu lie Asn Phe Glu Asp Val Asn 
220 225 230 235 

ggt aaa gtg tgg agg ttc cgt tac tea tac tgg aac agt agt caa agt 833 
Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser 
240 245 250 

tac gtg ttg ace aag gga tgg agt cga ttc gtc aag gag aag aat ctt 881 
Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu 
255 260 265 

cga gcc ggt gat gtt gtt act ttc gag aga teg aec gga eta gag egg 929 
Arg Ala Gly Asp Val Val Thr Phe Glu Arg Ser Thr Gly Leu Glu Arg 
270 275 280 

cag tta tat att gat tgg aaa gtt egg tct ggt eeg aga gaa aae eeg 977 
Gin Leu Tyr lie Asp Trp Lys Val Arg Ser Gly Pro Arg Glu Asn Pro 
285 290 295 

gtt cag gtg gtg gtt egg ctt ttc gga gtt gat ate ttt aat gtg ace 1025 
Val Gin Val Val Val Arg Leu Phe Gly Val Asp lie Phe Asn Val Thr 

300 305 310 315 

ace gtg aag cca aac gac gtc gtg gcc gtt tgc ggt gga aag aga tct 10 73 

Thr Val Lys Pro Asn Asp Val Val Ala Val Cys Gly Gly Lys Arg Ser 
320 325 330 

cga gat gtt gat gat atg ttt gcg tta egg tgt tec aag aag eag gcg 1121 
Arg Asp Val Asp Asp Met Phe Ala Leu Arg Cys Ser Lys Lys Gin Ala 
335 340 345 
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ata ate aat get ttg tga catatttcct tttccgattt tatgctttcg 1169 
lie lie Asn Ala Leu 
350 

ttttttaatt tttttttttg tcaagttgtg taggttgtga ttcatgctag gttgtattta 1229 
ggaaaagaga taagacc 124 6 

<210> 6 

<211> 352 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 6 

Met Asp Ser Ser Cys lie Asp Glu lie Ser Ser Ser Thr Ser Glu Ser 
1 5 ' 10 15 

Phe Ser Ala Thr Thr Ala Lys Lys Leu Ser Pro Pro Pro Ala Ala Ala 
20 25 30 

Leu Arg Leu Tyr Arg Met Gly Ser Gly Gly Ser Ser Val Val Leu Asp 
35 40 45 

Pro Glu Asn Gly Leu Glu Thr Glu Ser Arg Lys Leu Pro Ser Ser Lys 
50 55 60 

Tyr Lys Gly Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin lie 
65 70 75 80 

Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Gin Glu 
85 90 95 

Glu Ala Ala Arg Ser Tyr Asp lie Ala Ala Cys Arg Phe Arg Gly Arg 
100 105 110 

Asp Ala Val Val Asn Phe Lys Asn Val Leu Glu Asp Gly Asp Leu Ala 
115 120 125 

Phe Leu Glu Ala His Ser Lys Ala Glu lie Val Asp Met Leu Arg Lys 
130 135 140 

His Thr Tyr Ala Asp Glu Leu Glu Gin Asn Asn Lys Arg Gin Leu Phe 
145 150 155 160 

Leu Ser Val Asp Ala Asn Gly Lys Arg Asn Gly Ser Ser Thr Thr Gin 
165 170 175 

Asn Asp Lys Val Leu Lys Thr Cys Glu Val Leu Phe Glu Lys Ala Val 
180 185 190 

Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val lie Pro Lys Gin 
195 200 205 

His Ala Glu Lys His Phe Pro Leu Pro Ser Pro Ser Pro Ala Val Thr 
210 215 220 

Lys Gly Val Leu lie Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg 
225 230 235 240 
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Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys 
245 250 255 

Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val 
260 265 270 

Val Thr Phe Glu Arg Ser Thr Gly Leu Glu Arg Gin Leu Tyr lie Asp 
275 280 285 

Trp Lys Val Arg Ser Gly Pro Arg Glu Asn Pro Val Gin Val Val Val 
290 295 300 

Arg Leu Phe Gly Val Asp lie Phe Asn Val Thr Thr Val Lys Pro Asn 
305 310 315 320 

Asp Val Val Ala Val Cys Gly Gly Lys Arg Ser Arg Asp Val Asp Asp 
325 330 335 

Met Phe Ala Leu Arg Cys Ser Lys Lys Gin Ala lie lie Asn Ala Leu 
340 345 350 



<210> 


7 


<211> 


1379 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(97) . . (1032) 


<223> 


G428 


<400> 


7 



ttacttttgt gtttcttcat attcttcaga agcaagcaca aggctaggga tcgaagaagc 60 

ggcgatcact gatcgtatct cactacgatc acatta atg gat aga atg tgt ggt 114 

Met Asp Arg Met Cys Gly 
1 5 

ttc cgc teg acg gaa gac tat teg gag aaa gcg acg ttg atg atg ccg 162 
Phe Arg Ser Thr Glu Asp Tyr Ser Glu Lys Ala Thr Leu Met Met Pro 
10 15 20 

tec gat tat cag tct ttg att tgt tea aec ace gga gac aat caa aga 210 
Ser Asp Tyr Gin Ser Leu lie Cys Ser Thr Thr Gly Asp Asn Gin Arg 
25 30 35 

ctg ttt gga tec gac gaa etc get acc get ttg tec teg gag ttg ctt 258 
Leu Phe Gly Ser Asp Glu Leu Ala Thr Ala Leu Ser Ser Glu Leu Leu 
40 45 50 

ccg cgt att cga aaa get gag gat aat tte tct ett agt gtc ate aaa 3 06 

Pro Arg lie Arg Lys Ala Glu Asp Asn Phe Ser Leu Ser Val lie Lys 
55 60 65 70 

tec aaa ate get tct cat cct ttg tat ect cgc tta etc caa acc tac 3 54 

Ser Lys lie Ala Ser His Pro Leu Tyr Pro Arg Leu Leu Gin Thr Tyr 
75 80 85 



ate gat tgc caa aag gtg gga gcg cct atg gaa ata gcg tgt ata ttg 
lie Asp Cys Gin Lys Val Gly Ala Pro Met Glu lie Ala Cys lie Leu 
90 95 100 



402 



gaa gag att cag cga gag aac cat gtg tac aag aga gat gtt get cea 450 
Glu Glu lie Gin Arg Glu Asn His Val Tyr Lys Arg Asp Val Ala Pro 
105 110 115 
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tta tct tgc ttt gga get gat cct gag ctt gat gaa ttc atg gaa acc 498 
Leu Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe Met Glu Thr 
120 125 130 

tac tgt gat ata ttg gtt aaa tac aaa acc gat ctt gcg agg ccg ttc 546 
Tyr Cys Asp lie Leu Val Lys Tyr Lys Thr Asp Leu Ala Arg Pro Phe 
135 140 145 150 

gac gag get aea aet ttc ata aac aag att gaa atg cag ett cag aae 594 
Asp Glu Ala Thr Thr Phe lie Asn Lys lie Glu Met Gin Leu Gin Asn 
155 160 165 

ttg tgc act ggt eea gcg tct get aca get ctt tea gat gat ggt gcg 642 
Leu Cys Thr Gly Pro Ala Ser Ala Thr Ala Leu Ser Asp Asp Gly Ala 

170 175 . 180 

gtt tea tct gac gag gaa ctg aga gaa gat gat gac ata gea gcg gat 690 
Val Ser Ser Asp Glu Glu Leu Arg Glu Asp Asp Asp lie Ala Ala Asp 
185 190 195 

gac age caa caa aga age aat gac cgc gat ctg aag gac cag eta eta 73 8 

Asp Ser Gin Gin Arg Ser Asn Asp Arg Asp Leu Lys Asp Gin Leu Leu 
200 205 210 

cgc aaa ttt ggt age cat ate agt tea ttg aaa etc gag ttc tct aaa 786 
Arg Lys Phe Gly Ser His lie Ser Ser Leu Lys Leu Glu Phe Ser Lys 
215 220 225 230 

aag aag aag aaa ggg aag eta eea aga gaa gea aga caa gcg ttg etc 834 
Lys Lys Lys Lys Gly Lys Leu Pro Arg Glu Ala Arg Gin Ala Leu Leu 
235 240 245 

gat tgg tgg aat gtt cat aat aaa tgg cct tac cct act gaa ggc gac 882 
Asp Trp Trp Asn Val His Asn Lys Trp Pro Tyr Pro Thr Glu Gly Asp 
250 255 260 

aaa ata get ctg get gaa gaa aca ggt ttg gat caa aaa caa ate aac 930 
Lys lie Ala Leu Ala Glu Glu Thr Gly Leu Asp Gin Lys Gin lie Asn 
265 270 275 

aat tgg ttt ata aae caa agg aaa cgc cat tgg aag cct teg gag aac 978 
Asn Trp Phe lie Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asn 
280 285 290 

atg ccg ttt gat atg atg gac gat tct aat gaa aea ttc ttt acc gag 1026 
Met Pro Phe Asp Met Met Asp Asp Ser Asn Glu Thr Phe Phe Thr Glu 
295 300 305 310 

gaa tga aaagagagae atgggattgt geattgtata atttttaeae tgttttccea 1082 
Glu 



agaaaagaaa 


acagtaaaaa 


gcttttggta 


aatgggacat 


catcgcgaat 


gaatggaacc 


1142 


agttagccaa 


aacggtcaag 


ggcgtggcgt 


aacgagacat 


tgtattggaa 


atagtggeaa 


1202 


tattatgtea 


ctaatettee 


aatggtccaa 


aatgatagat 


ttcttatttg 


tattgaaeet 


1262 


tacttagata 


gctgatgtgt 


caactaaata 


atttattttc 


atcettatac 


tacttgtatc 


1322 


aatgtetcta 


attgateaat 


tgttgcttge 


tattcaaaaa 


aaaaaaaaaa 


aaaaaaa 


1379 



<210> 8 
<211> 311 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Asp Arg Met Cys Gly Phe Arg Ser Thr Glu Asp Tyr Ser Glu Lys 
15 10 15 



Ala Thr Leu Met Met Pro Ser Asp Tyr Gin Ser Leu lie Cys Ser Thr 
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20 25 30 

Thr Gly Asp Asn Gin Arg Leu Phe Gly Ser Asp Glu Leu Ala Thr Ala 
35 40 45 

Leu Ser Ser Glu Leu Leu Pro Arg lie Arg Lys Ala Glu Asp Asn Phe 
50 55 60 

Ser Leu Ser Val lie Lys Ser Lys lie Ala Ser His Pro Leu Tyr Pro 
65 70 75 80 

Arg Leu Leu Gin Thr Tyr lie Asp Cys Gin Lys Val Gly Ala Pro Met 
85 90 95 

Glu lie Ala Cys lie Leu Glu Glu lie Gin Arg Glu Asn His Val Tyr 
100 105 110 

Lys Arg Asp Val Ala Pro Leu Ser Cys Phe Gly Ala Asp Pro Glu Leu 
115 120 125 

Asp Glu Phe Met Glu Thr Tyr Cys Asp lie Leu Val Lys Tyr Lys Thr 
130 135 140 

Asp Leu Ala Arg Pro Phe Asp Glu Ala Thr Thr Phe lie Asn Lys lie 
145 150 155 160 

Glu Met Gin Leu Gin Asn Leu Cys Thr Gly Pro Ala Ser Ala Thr Ala 
165 170 175 

Leu Ser Asp Asp Gly Ala Val Ser Ser Asp Glu Glu Leu Arg Glu Asp 
180 185 190 

Asp Asp lie Ala Ala Asp Asp Ser Gin Gin Arg Ser Asn Asp Arg Asp 
195 200 205 

Leu Lys Asp Gin Leu Leu Arg Lys Phe Gly Ser His He Ser Ser Leu 
210 215 220 

Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg Glu 
225 230 235 240 

Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Val His Asn Lys Trp Pro 
245 250 255 

Tyr Pro Thr Glu Gly Asp Lys He Ala Leu Ala Glu Glu Thr Gly Leu 
260 265 270 

Asp Gin Lys Gin He Asn Asn Trp Phe lie Asn Gin Arg Lys Arg His 
275 280 285 

Trp Lys Pro Ser Glu Asn Met Pro Phe Asp Met Met Asp Asp Ser Asn 
290 295 300 

Glu Thr Phe Phe Thr Glu Glu 
305 310 
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<210> 9 

<211> 1571 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (428) . . (1402) 

<223> G869 



<400> 9 
aggaacagtg 


aaaggttcgg 


ttttttgggt 


ttcgatctga 


taatcaacaa 


gaaaaaaggg 


60 


tttgatttat 


gtcggctggg 


tttgaatcga 


ctgtgatttt 


gtctttgatt 


catatctctt 


120 


ctccgatttc 


atcatcatct 


tccccatcat 


cgtcgtcttt 


gaaatcttgt 


cttctcaacg 


180 


ctcttcactt 


ctgctgtaat 


aagcagaggc 


ttgttctgga 


gactccttct 


ctttccatgc 


240 


gcttaagacc 


caaaaggact 


tgttctagtg 


ttgaagtctt 


tgggggtttt 


cacataaagc 


300 


agcaaaagtt 


ttcttttttc 


atagttcgct 


gagagttttg 


agttttgata 


ccaaaaaagt 


360 


tttgaccttt 


tagagtgatt 


ttttgttctt 


tctgttttct 


gggtattttt 


gaggagtggg 


420 


tttaaca atg gtt gcg att aga aag gaa cag tct 


ttg agt ggt gtt agt 


469 



Met Val Ala lie Arg Lys Glu Gin Ser Leu Ser Gly Val Ser 
1 5 10 

age gag att aag aag aga get aag aga aac act eta teg tec ett eet 517 
Ser Glu lie Lys Lys Arg Ala Lys Arg Asn Thr Leu Ser Ser Leu Pro 
15 20 25 30 

caa gaa aec eaa eet ttg agg aaa gtc cgt att att gtg aat gat eet 565 
Gin Glu Thr Gin Pro Leu Arg Lys Val Arg lie lie Val Asn Asp Pro 
35 40 45 

tat get act gat gat tee tct agt gat gag gaa gag ctt aag gtt cct 613 
Tyr Ala Thr Asp Asp Ser Ser Ser Asp Glu Glu Glu Leu Lys Val Pro 
50 55 60 

aag cca agg aaa atg aaa cgt ate gtt cgt gag att aac ttt cct tct 661 
Lys Pro Arg Lys Met Lys Arg lie Val Arg Glu lie Asn Phe Pro Ser 
65 70 75 

atg gaa gtt tct gaa cag cct tct gag agt tct tct cag gac agt act 709 
Met Glu Val Ser Glu Gin Pro Ser Glu Ser Ser Ser Gin Asp Ser Thr 

80 85 90 

aaa act gat ggc aag ata get gtg tea get tct eet get gtt eet agg 757 
Lys Thr Asp Gly Lys lie Ala Val Ser Ala Ser Pro Ala Val Pro Arg 
95 100 105 110 

aag aag cct gtt ggt gtt agg eaa agg aaa tgg ggg aaa tgg get get 805 
Lys Lys Pro Val Gly Val Arg Gin Arg Lys Trp Gly Lys Trp Ala Ala 
115 120 125 

gag att aga gat cct att aag aaa act agg act tgg ttg ggt act ttt 853 
Glu lie Arg Asp Pro lie Lys Lys Thr Arg Thr Trp Leu Gly Thr Phe 
130 135 140 

gat act ctt gaa gaa get get aaa get tat gat get aag aag ett gag 901 
Asp Thr Leu Glu Glu Ala Ala Lys Ala Tyr Asp Ala Lys Lys Leu Glu 
145 150 155 

ttt gat get att gtt get gga aat gtg tec act act aaa cgt gat gtt 949 
Phe Asp Ala lie Val Ala Gly Asn Val Ser Thr Thr Lys Arg Asp Val 
160 165 170 

tct tea tct gag act age eaa tgc tct cgt tct tea cct gtt gtt cct 997 
Ser Ser Ser Glu Thr Ser Gin Cys Ser Arg Ser Ser Pro Val Val Pro 
175 180 185 190 

gtt gag caa gat gac act tct gca tea get etc act tgt gtc aac aac 1045 
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val 


Glu 


Gin 


Asp Asp 

195 


Thr 


Ser 


Ala 


Ser 


Ala 

2 00 


Leu 


Thr 


Cys 


Val 


Asn 
2 0 5 


Asn 




cct 
Pro 


gat 

Asp 


gac 
Asp 


gtc 
Val 
210 


teg 
Ser 


ace 
Thr 


gtt 
Val 


get 
Ala 


cca 
Pro 
215 


act 
Thr 


get 
Ala 


cca 
Pro 


act 
Thr 


cca 
Pro 
220 


aat 
Asn 


gtt 

Val 


1093 


cct 
Pro 


get 
Ala 


ggt 
Gly 
225 


gga 
Gly 


aac 
Asn 


aag 
Lys 


gaa 
Glu 


aeg 
Thr 
230 


ttg 
Leu 


ttc 
Phe 


gat 

Asp 


ttc 
Phe 


gac 
Asp 


ttt 
Phe 


act 
Thr 


aat 
Asn 


1141 


eta 
Leu 


cag 
Gin 
240 


ate 
He 


cct 
Pro 


gat 
Asp 


ttt 
Phe 


ggt 
Gly 
245 


ttc 
Phe 


ttg 
Leu 


gca 
Ala 


gag 
Glu 


gag 
Glu 

o c n 


caa 
Gin 


caa 
Gin 


gac 
Asp 


eta 
Leu 


1189 


gac 
Asp 
255 


ttc 
Phe 


gat 
Asp 


tgt 
Cys 


ttc 
Phe 


etc 
Leu 
260 


gcg 
Ala 


gat 
Asp 


gat 
Asp 


cag 
Gin 


ttt 
Phe 
265 


gat 
Asp 


gat 
Asp 


ttc 
Phe 


ggc 
Gly 


ttg 
Leu 
270 


1237 


ctt 
Leu 


gat 
Asp 


gac 
Asp 


att 
He 


caa 
Gin 
275 


gga 
Gly 


ttc 
Phe 


gaa 
Glu 


gat 
Asp 


aac 
Asn 
280 


ggt 
Gly 


cca 
Pro 


agt 
Ser 


gcg 
Ala 


tta 
Leu 
285 


cca 
Pro 


1285 


gat 
Asp 


ttc 
Phe 


gac 
Asp 


ttt 
Phe 
290 


gcg 
Ala 


gat 
Asp 


gtt 
Val 


gaa 
Glu 


gat 
Asp 
295 


ett 
Leu 


eag 
Gin 


eta 
Leu 


get 
Ala 


gac 
Asp 
300 


tct 
Ser 


agt 
Ser 


1333 


ttc 
Phe 


ggt 
Gly 


ttc 
Phe 
305 


ctt 
Leu 


gat 
Asp 


caa 
Gin 


ctt 
Leu 


get 
Ala 
310 


cct 
Pro 


ate 
He 


aac 
Asn 


ate 
He 


tct 
Ser 

315 


tgc 
Cys 


cca 
Pro 


tta 
Leu 


1381 



aaa agt ttt gca get tea tag gatettgett agtaatgtta agtgagaaga 1432 
Lys Ser Phe Ala Ala Ser 

320 

gtgttttgtt ttttcgttta tgctttagta atttaagaca tacaaaagtg tgtgttccgg 1492 

attgtagtaa gatcttaaga cataaagecg ggttttgcaa ttaggaateg agttttaatg 1552 

aagttttagt ttatgtttg 1571 

<210> 10 
<211> 324 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 10 

Met Val Ala He Arg Lys Glu Gin Ser Leu Ser Gly Val Ser Ser Glu 
15 10 15 

He Lys Lys Arg Ala Lys Arg Asn Thr Leu Ser Ser Leu Pro Gin Glu 
20 25 30 

Thr Gin Pro Leu Arg Lys Val Arg He He Val Asn Asp Pro Tyr Ala 
35 40 45 

Thr Asp Asp Ser Ser Ser Asp Glu Glu Glu Leu Lys Val Pro Lys Pro 
50 55 60 

Arg Lys Met Lys Arg He Val Arg Glu He Asn Phe Pro Ser Met Glu 
65 70 75 80 

Val Ser Glu Gin Pro Ser Glu Ser Ser Ser Gin Asp Ser Thr Lys Thr 
85 90 95 

Asp Gly Lys He Ala Val Ser Ala Ser Pro Ala Val Pro Arg Lys Lys 
100 105 110 
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Pro Val Gly Val Arg Gin Arg Lys Trp Gly Lys Trp Ala Ala Glu lie 
115 120 125 

Arg Asp Pro lie Lys Lys Thr Arg Thr Trp Leu Gly Thr Phe Asp Thr 
130 135 140 

Leu Glu Glu Ala Ala Lys Ala Tyr Asp Ala Lys Lys Leu Glu Phe Asp 
145 150 155 160 

Ala lie Val Ala Gly Asri Val Ser Thr Thr Lys Arg Asp Val Ser Ser 
165 170 175 

Ser Glu Thr Ser Gin Cys Ser Arg Ser Ser Pro Val Val Pro Val Glu 
180 185 190 

Gin Asp Asp Thr Ser Ala Ser Ala Leu Thr Cys Val Asn Asn Pro Asp 
195 200 205 

Asp Val Ser Thr Val Ala Pro Thr Ala Pro Thr Pro Asn Val Pro Ala 
210 215 220 

Gly Gly Asn Lys Glu Thr Leu Phe Asp Phe Asp Phe Thr Asn Leu Gin 
225 230 235 240 

lie Pro Asp Phe Gly Phe Leu Ala Glu Glu Gin Gin Asp Leu Asp Phe 
245 250 255 

Asp Cys Phe Leu Ala Asp Asp Gin Phe Asp Asp Phe Gly Leu Leu Asp 
260 265 270 

Asp lie Gin Gly Phe Glu Asp Asn Gly Pro Ser Ala Leu Pro Asp Phe 
275 280 285 

Asp Phe Ala Asp Val Glu Asp Leu Gin Leu Ala Asp Ser Ser Phe Gly 
290 295 300 

Phe Leu Asp Gin Leu Ala Pro lie Asn lie Ser Cys Pro Leu Lys Ser 
305 310 315 320 

Phe Ala Ala Ser 



<210> 


11 


<211> 


1166 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(88) . . (951) 


<223> 


G1269 


<400> 


11 



aacaattctc tctctcttta ttcttcttct tcagcttcag atttcagatc ttaaatcttc 60 

aagtcttctt cttcttcttc tgcaacc atg get atg cag gaa cgt tgt gag agt 114 

Met Ala Met Gin Glu Arg Cys Glu Ser 
1 5 
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tta tgt tct gat gaa ctt ata tct tec tea gat gcc ttt tac etc aag 162 
Leu Cys Ser Asp Glu Leu He Ser Ser Ser Asp Ala Phe Tyr Leu Lys 
10 15 20 25 



aca aga aag ect tat ace ate act aaa eaa aga gag aaa tgg aea gaa 
Thr Arg Lys Pro Tyr Thr He Thr Lys Gin Arg Glu Lys Trp Thr Glu 
30 35 40 



atg gta tae get gaa eta aec gga tec aag ctg att cag gat gaa gat 
Met Val Tyr Ala Glu Leu Thr Gly Ser Lys Leu He Gin Asp Glu Asp 

125 130 135 



210 



gca gag eat gag aag ttt gta gaa gea ttg aaa etc tat ggc aga get 258 
Ala Glu His Glu Lys Phe Val Glu Ala Leu Lys Leu Tyr Gly Arg Ala 
45 50 55 

tgg aga cga ate gaa gaa eat gtt gga aea aaa act gca gtt cag att 306 
Trp Arg Arg He Glu Glu His Val Gly Thr Lys Thr Ala Val Gin He 
60 65 70 

ega age cat gcg cag aag ttc ttt act aag gtt get cge gat ttt ggt 354 
Arg Ser His Ala Gin Lys Phe Phe Thr Lys Val Ala Arg Asp Phe Gly 
75 80 85 

gtt age tct gag tee att gag ate ccg ect cca agg cca aag aga aag 402 
Val Ser Ser Glu Ser He Glu He Pro Pro Pro Arg Pro Lys Arg Lys 
90 95 100 105 

ccg atg cat eet tac ect aga aag ctt gtg att ect gat gca aaa gag 450 
Pro Met His Pro Tyr Pro Arg Lys Leu Val He Pro Asp Ala Lys Glu 
110 115 120 



498 



aac ega tct cca aca teg gtt tta tea get cat ggc tea gat gga tta 546 
Asn Arg Ser Pro Thr Ser Val Leu Ser Ala His Gly Ser Asp Gly Leu 

140 145 150 

ggt tec att ggt tea aat tea ect aac tct tct tea get gag tta tea 594 
Gly Ser He Gly Ser Asn Ser Pro Asn Ser Ser Ser Ala Glu Leu Ser 
155 160 165 

tct cac aca gag gaa tea ttg tct eta gaa gea gag aec aaa cag age 642 
Ser His Thr Glu Glu Ser Leu Ser Leu Glu Ala Glu Thr Lys Gin Ser 
170 175 180 185 

ctt aag etc ttt gga aaa act ttt gta gtt ggt gat tac aac tct tea 690 
Leu Lys Leu Phe Gly Lys Thr Phe Val Val Gly Asp Tyr Asn Ser Ser 
190 195 200 

atg agt tgt gat gat tct gaa gat ggc aag aag aag eta tac tea gaa 738 
Met Ser Cys Asp Asp Ser Glu Asp Gly Lys Lys Lys Leu Tyr Ser Glu 
205 210 215 

aca cag tct ctt caa tgt tct tct tct act tea gaa aac get gaa aca 786 
Thr Gin Ser Leu Gin Cys Ser Ser Ser Thr Ser Glu Asn Ala Glu Thr 
220 225 230 

gaa gtg gta gtg teg gag ttc aaa aga agt gag aga tea get ttc tct 834 
Glu Val Val Val Ser Glu Phe Lys Arg Ser Glu Arg Ser Ala Phe Ser 
235 240 245 

cag tta aaa teg teg gtg act gag atg aac aac atg aga ggg ttc atg 882 
Gin Leu Lys Ser Ser Val Thr Glu Met Asn Asn Met Arg Gly Phe Met 
250 255 260 265 

ect tae aaa aag aga gta aag gtg gaa gaa aac att gae aat gta aaa 930 
Pro Tyr Lys Lys Arg Val Lys Val Glu Glu Asn He Asp Asn Val Lys 
270 275 280 

tta tea tat ect ttg tgg tga agtgttcgtt tgtgtcaagt eagttgtgta 981 
Leu Ser Tyr Pro Leu Trp 
285 

aactcttttg atetcaacat cagattatgt gtataatgte agagtattag ggaaagtttt 1041 



Page 16 



wo 01/36444 



PCT/USOO/31325 



MBI0018 Sequence Listing. ST25 
tttggattag attcgtaaga tcactccaaa gtttcgtgtc tttccatata accagttaga 1101 

aattgagatc cttgtactta aacattttta tttgatcaat caaatcttct tgatgaaaaa 1161 

aaaaa 1166 

<210> 12 
<211> 287 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 12 

Met Ala Met Gin Glu Arg Cys Glu Ser Leu Cys Ser Asp Glu Leu lie 
15 10 15 

Ser Ser Ser Asp Ala Phe Tyr Leu Lys Thr Arg Lys Pro Tyr Thr lie 
20 25 30 

Thr Lys Gin Arg Glu Lys Trp Thr Glu Ala Glu His Glu Lys Phe Val 
35 40 45 

Glu Ala Leu Lys Leu Tyr Gly Arg Ala Trp Arg Arg lie Glu Glu His 
50 55 60 

Val Gly Thr Lys Thr Ala Val Gin lie Arg Ser His Ala Gin Lys Phe 
65 70 75 80 

Phe Thr Lys Val Ala Arg Asp Phe Gly Val Ser Ser Glu Ser lie Glu 
85 90 95 

lie Pro Pro Pro Arg Pro Lys Arg Lys Pro Met His Pro Tyr Pro Arg 
100 105 110 

Lys Leu Val lie Pro Asp Ala Lys Glu Met Val Tyr Ala Glu Leu Thr 
115 120 125 

Gly Ser Lys Leu lie Gin Asp Glu Asp Asn Arg Ser Pro Thr Ser Val 
130 135 140 

Leu Ser Ala His Gly Ser Asp Gly Leu Gly Ser He Gly Ser Asn Ser 
145 150 155 160 

Pro Asn Ser Ser Ser Ala Glu Leu Ser Ser His Thr Glu Glu Ser Leu 
165 170 175 

Ser Leu Glu Ala Glu Thr Lys Gin Ser Leu Lys Leu Phe Gly Lys Thr 
180 185 190 

Phe Val Val Gly Asp Tyr Asn Ser Ser Met Ser Cys Asp Asp Ser Glu 
195 200 205 

Asp Gly Lys Lys Lys Leu Tyr Ser Glu Thr Gin Ser Leu Gin Cys Ser 
210 215 220 

Ser Ser Thr Ser Glu Asn Ala Glu Thr Glu Val Val Val Ser Glu Phe 
225 230 235 240 



Lys Arg Ser Glu Arg Ser Ala Phe Ser Gin Leu Lys Ser Ser Val Thr 
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245 250 255 

Glu Met Asn Asn Met Arg Gly Phe Met Pro Tyr Lys Lys Arg Val Lys 
260 265 270 

Val Glu Glu Asn lie Asp Asn Val Lys Leu Ser Tyr Pro Leu Trp 

280 285 





275 


<210> 


13 


<211> 


2031 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(240) . . (1574) 


<223> 


G1038 


<400> 


13 



gctcgttttc aaattaaaaa cagggagaaa tttggaaatt ccagtacgac gggagataaa 60 

acctaacata cgccatggtg accgttatct aaactacgcc aaaatatttg aagtgtcgtc 120 

gtttcataat aaaacgcaaa caaaaaccca ctcccacttt ctcctttcca aaaaaagaac 180 

tctcgccact ttctctgctc ttttctttct ctctctcttt cttgttttcg ccggcgatc 239 

atg gag aaa age ggc ttc tct ccc gtc ggt eta agg gtt ctt gtc gta 287 
Met Glu Lys Ser Gly Phe Ser Pro Val Gly Leu Arg Val Leu Val Val 
15 10 15 

gac gat gat cca act tgg etc aag att etc gag aaa atg etc aag aag 335 
Asp Asp Asp Pro Thr Trp Leu Lys lie Leu Glu Lys Met Leu Lys Lys 
20 25 30 

tgt tct tac gaa gta aeg ace tgt gga tta get aga gag get ttg agg 383 
Cys Ser Tyr Glu Val Thr Thr Cys Gly Leu Ala Arg Glu Ala Leu Arg 
35 40 45 

ttg ctg agg gag cgt aaa gat gga tat gat ate gtg ate age gat gtg 431 
Leu Leu Arg Glu Arg Lys Asp Gly Tyr Asp lie Val lie Ser Asp Val 
50 55 60 

aac atg cct gac atg gat ggt ttc aag ctt ctt gag cat gtt ggt ctt 479 
Asn Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu 
65 70 75 80 

gaa tta gac etc cct gta ata atg atg teg gtg gac ggc gaa aca age 527 
Glu Leu Asp Leu Pro Val lie Met Met Ser Val Asp Gly Glu Thr Ser 
85 90 95 

ega gtg atg aag gga gtg eae aeg gga get tgt gat tac etc ttg aag 575 
Arg Val Met Lys Gly Val His Thr Gly Ala Cys Asp Tyr Leu Leu Lys 
100 105 110 

eeg ata aga atg aag gag tta aag att ata tgg eaa cat gtt etg aga 623 
Pro lie Arg Met Lys Glu Leu Lys lie lie Trp Gin His Val Leu Arg 

115 120 125 

aag aag ctt eaa gaa gtg aga gat ate gaa ggc tgt gga tac gaa gga 671 
Lys Lys Leu Gin Glu Val Arg Asp lie Glu Gly Cys Gly Tyr Glu Gly 
130 135 140 

gga geg gat tgg ate act ega tac gat gaa gca cat ttt ctt gga ggt 719 
Gly Ala Asp Trp lie Thr Arg Tyr Asp Glu Ala His Phe Leu Gly Gly 
145 150 155 160 

ggt gaa gat gtt tet ttt ggg aaa aag aga aaa gac ttt gac ttt gag 767 
Gly Glu Asp Val Ser Phe Gly Lys Lys Arg Lys Asp Phe Asp Phe Glu 
165 170 175 
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aag aag ctt ctt caa gat gag agt gat cca tea tct tct tct tec aag 815 
Lys Lys Leu Leu Gin Asp Glu Ser Asp Pro Ser Ser Ser Ser Ser Lys 
180 185 190 

aaa get aga gtt gtt tgg tct ttt gag ctt cat cat aag ttt gtc aac 863 
Lys Ala Arg Val Val Trp Ser Phe Glu Leu His His Lys Phe Val Asn 
195 200 205 

gcc gtt aac caa ate gga tgc gat cac aaa get ggt ccc aag aag ata 911 
Ala Val Asn Gin lie Gly Cys Asp His Lys Ala Gly Pro Lys Lys lie 
210 215 220 

ttg gat etc atg aat gtt cca tgg etc act aga gaa aat gtt gea age 959 
Leu Asp Leu Met Asn Val Pro Trp Leu Thr Arg Glu Asn Val Ala Ser 

225 230 235 240 

cac ctt cag aaa tat aga ctt tac ctg age aga tta gag aaa gga aag 1007 
His Leu Gin Lys Tyr Arg Leu Tyr Leu Ser Arg Leu Glu Lys Gly Lys 
245 250 255 

gag etc aag tgt tat tea ggt ggc gtg aag aat gcg gat tea tct cca 1055 
Glu Leu Lys Cys Tyr Ser Gly Gly Val Lys Asn Ala Asp Ser Ser Pro 
260 265 270 

aaa gat gtc gaa gtg aat tea ggc tac caa age cet ggg agg age age 1103 
Lys Asp Val Glu Val Asn Ser Gly Tyr Gin Ser Pro Gly Arg Ser Ser 
275 280 285 

tat gta ttc tct gga gga aat tct ctg ate caa aaa gea aca gag att 1151 
Tyr Val Phe Ser Gly Gly Asn Ser Leu lie Gin Lys Ala Thr Glu lie 
290 295 300 

gat cca aag cca ctt get tea get tct ttg tct gac ccc aac ace gat 1199 
Asp Pro Lys Pro Leu Ala Ser Ala Ser Leu Ser Asp Pro Asn Thr Asp 
305 310 315 320 

gtg ate atg cet ccg aaa aca aaa aag acg cgt ata gga ttt gat ect 1247 
Val lie Met Pro Pro Lys Thr Lys Lys Thr Arg lie Gly Phe Asp Pro 
325 330 335 

CCC att tec tec tct gcg ttt gac tct ctg ctt cet tgg aat gat gtt 1295 
Pro lie Ser Ser Ser Ala Phe Asp Ser Leu Leu Pro Trp Asn Asp Val 
340 345 350 

cca gag gtc ctt gaa teg aag ccg gtt ctg tat gag aat age ttt etc' 1343 
Pro Glu Val Leu Glu Ser Lys Pro Val Leu Tyr Glu Asn Ser Phe Leu 

355 360 365 

cag caa caa cca ttg cca agt caa agt tec tat gtt gea att tct gea 1391 
Gin Gin Gin Pro Leu Pro Ser Gin Ser Ser Tyr Val Ala lie Ser Ala 

370 375 380 

cca tct etc atg gag gag gaa atg aag cet ect tat gag aca cca gea 1439 
Pro Ser Leu Met Glu Glu Glu Met Lys Pro Pro Tyr Glu Thr Pro Ala 
385 390 395 400 

gga ggc agt agt gtg aat gea gat gag ttt etc atg cca caa gac aag 1487 
Gly Gly Ser Ser Val Asn Ala Asp Glu Phe Leu Met Pro Gin Asp Lys 
405 410 415 

ate ect act gta ace ett caa gat ttg gat ccc tct gee atg aag ctg 1535 
lie Pro Thr Val Thr Leu Gin Asp Leu Asp Pro Ser Ala Met Lys Leu 
420 425 430 

cag gag ttc aac aca gaa ggc gat tct gaa gaa get tga actggggaac 1584 
Gin Glu Phe Asn Thr Glu Gly Asp Ser Glu Glu Ala 
435 440 

ttceagaate acateattet gtttctttag acaetgaett agaettgact tggcttcaag 1644 

gegagcgttt cttgeaaaca ecgactecag ttteaagata cagtagtagc ceatcaetce 1704 

tatctgagct cccagcecac cttaattggt atggaaatga geggctgect gaeeetgaeg 1764 

agtattcctt catggtagac caaggtttat teatatctta acettgttce aataacttet 1824 

Page 19 



wo 01/36444 



PCT/USOO/31325 







MBI0018 Sequence 


Listing . ST25 




tttcgtatat 


tggttggtgt 


aatgcagaaa 


gattttgtgg 


gtatacctga 


aaataatctt 


1884 


gctttcccaa 


gaaccttcca 


tgatcggatg 


cattgtacaa 


taatccacga 


gtgtcgtagg 


1944 


ctaattacac 


caaacaggtt 


gatgacagtg 


ataaggccac 


atgtttcaca 


ccgtcgctta 


2004 


agatctttac 


tgtcacctgg 


aaggaaa 








2031 



<210> 14 
<211> 444 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

Met Glu Lys Ser Gly Phe Ser Pro Val Gly Leu Arg Val Leu Val Val 
15 10 15 

Asp Asp Asp Pro Thr Trp Leu Lys He Leu Glu Lys Met Leu Lys Lys 
20 25 30 

Cys Ser Tyr Glu Val Thr Thr Cys Gly Leu Ala Arg Glu Ala Leu Arg 
35 40 45 

Leu Leu Arg Glu Arg Lys Asp Gly Tyr Asp He Val He Ser Asp Val 
50 55 60 

Asn Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu 
65 70 75 80 

Glu Leu Asp Leu Pro Val He Met Met Ser Val Asp Gly Glu Thr Ser 
85 90 95 

Arg Val Met Lys Gly Val His Thr Gly Ala Cys Asp Tyr Leu Leu Lys 
100 105 110 

Pro He Arg Met Lys Glu Leu Lys He He Trp Gin His Val Leu Arg 
115 120 125 

Lys Lys Leu Gin Glu Val Arg Asp He Glu Gly Cys Gly Tyr Glu Gly 
130 135 140 

Gly Ala Asp Trp He Thr Arg Tyr Asp Glu Ala His Phe Leu Gly Gly 
145 150 155 160 

Gly Glu Asp Val Ser Phe Gly Lys Lys Arg Lys Asp Phe Asp Phe Glu 
165 170 175 

Lys Lys Leu Leu Gin Asp Glu Ser Asp Pro Ser Ser Ser Ser Ser Lys 
180 185 190 

Lys Ala Arg Val Val Trp Ser Phe Glu Leu His His Lys Phe Val Asn 
195 200 205 

Ala Val Asn Gin He Gly Cys Asp His Lys Ala Gly Pro Lys Lys He 
210 215 220 

Leu Asp Leu Met Asn Val Pro Trp Leu Thr Arg Glu Asn Val Ala Ser 
225 230 235 240 
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His Leu Gin Lys Tyr Arg Leu Tyr Leu Ser Arg Leu Glu Lys Gly Lys 
245 250 255 

Glu Leu Lys Cys Tyr Ser Gly Gly Val Lys Asn Ala Asp Ser Ser Pro 
260 265 270 

Lys Asp Val Glu Val Asn Ser Gly Tyr Gin Ser Pro Gly Arg Ser Ser 
275 280 285 

Tyr Val Phe Ser Gly Gly Asn Ser Leu lie Gin Lys Ala Thr Glu lie 
290 295 300 

Asp Pro Lys Pro Leu Ala Ser Ala Ser Leu Ser Asp Pro Asn Thr Asp 
305 310 315 320 

Val lie Met Pro Pro Lys Thr Lys Lys Thr Arg lie Gly Phe Asp Pro 
325 330 335 

Pro lie Ser Ser Ser Ala Phe Asp Ser Leu Leu Pro Trp Asn Asp Val 
340 345 350 

Pro Glu Val Leu Glu Ser Lys Pro Val Leu Tyr Glu Asn Ser Phe Leu 
355 360 365 

Gin Gin Gin Pro Leu Pro Ser Gin Ser Ser Tyr Val Ala lie Ser Ala 
370 375 380 

Pro Ser Leu Met Glu Glu Glu Met Lys Pro Pro Tyr Glu Thr Pro Ala 
385 390 395 400 

Gly Gly Ser Ser Val Asn Ala Asp Glu Phe Leu Met Pro Gin Asp Lys 
405 410 415 

lie Pro Thr Val Thr Leu Gin Asp Leu Asp Pro Ser Ala Met Lys Leu 
420 425 430 

Gin Glu Phe Asn Thr Glu Gly Asp Ser Glu Glu Ala 
435 440 



<210> 


15 


<211> 


2821 


<212> 


DNA 


<213> 


Arabidopsis thai i ana 


<220> 




<221> 


CDS 


<222> 


(188) . . (2716) 


<223> 


G438 


<400> 


15 



cggggtaccc aagccacgac cgtagaatct tcttttgtct gaaaagaatt acaatttacg 60 

tttctcttac gatacgacgg actttccgaa gaaattaatt taaagagaaa agaagaagaa 120 

gccaaagaag aagaagaagc tagaagaaac agtaaagttt gagacttttt ttgagggtcg 180 

agctaaa atg gag atg gcg gtg get aac cac cgt gag aga age agt gac 229 
Met Glu Met Ala Val Ala Asn His Arg Glu Arg Ser Ser Asp 
15 10 
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agt atg aat aga cat tta gat agt age ggt aag tac gtt agg tac aca 2 77 

Ser Met Asn Arg His Leu Asp Ser Ser Gly Lys Tyr Val Arg Tyr Thr 
15 20 25 30 

get gag caa gtc gag get ett gag cgt gtc tac get gag tgt cet aag 325 
Ala Glu Gin Val Glu Ala Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys 
35 40 45 

cct age tct etc cgt ega caa caa ttg ate cgt gaa tgt tec att ttg 373 
Pro Ser Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Ser lie Leu 
50 55 60 

gee aat att gag cct aag eag ate aaa gtc tgg ttt eag aac egc agg 421 
Ala Asn lie Glu Pro Lys Gin lie Lys Val Trp Phe Gin Asn Arg Arg 
65 70 75 

tgt ega gat aag eag agg aaa gag gcg teg agg etc eag age gta aac 469 
Cys Arg Asp Lys Gin Arg Lys Glu Ala Ser Arg Leu Gin Ser Val Asn 
80 85 90 

egg aag etc tct gcg atg aat aaa etg ttg atg gag gag aat gat agg 517 
Arg Lys Leu Ser Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg 
95 100 105 110 

ttg eag aag eag gtt tct eag ctt gtc tgc gaa aat gga tat atg aaa 565 
Leu Gin Lys Gin Val Ser Gin Leu Val Cys Glu Asn Gly Tyr Met Lys 
115 120 125 

eag eag eta act act gtt gtt aac gat cea age tgt gaa tct gtg gtc 613 
Gin Gin Leu Thr Thr Val Val Asn Asp Pro Ser Cys Glu Ser Val Val 

130 135 140 

aca act cct eag eat teg ett aga gat gcg aat agt cct get gga ttg 661 
Thr Thr Pro Gin His Ser Leu Arg Asp Ala Asn Ser Pro Ala Gly Leu 

145 150 155 

etc tea ate gea gag gag act ttg gea gag tte eta tec aag get aca 709 
Leu Ser lie Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr 
160 165 170 

gga act get gtt gat tgg gtt eag atg cct ggg atg aag cct ggt ecg 757 
Gly Thr Ala Val Asp Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro 
175 180 185 190 

gat teg gtt gge ate ttt gee att teg caa aga tgc aat gga gtg gea 805 
Asp Ser Val Gly lie Phe Ala lie Ser Gin Arg Cys Asn Gly Val Ala 
195 200 205 

get ega gcc tgt ggt ctt gtt age tta gaa cct atg aag att gea gag 853 
Ala Arg Ala Cys Gly Leu Val Ser Leu Glu Pro Met Lys lie Ala Glu 
210 215 220 

ate etc aaa gat egg cca tct tgg tte cgt gae tgt agg age ctt gaa 901 
lie Leu Lys Asp Arg Pro Ser Trp Phe Arg Asp Cys Arg Ser Leu Glu 
225 230 235 

gtt tte act atg tte ceg get ggt aat ggt ggc aca ate gag ett gtt 949 
Val Phe Thr Met Phe Pro Ala Gly Asn Gly Gly Thr lie Glu Leu Val 
240 245 250 

tat atg cag acg tat gea cca acg act etg get cet gcc egc gat tte 997 
Tyr Met Gin Thr Tyr Ala Pro Thr Thr Leu Ala Pro Ala Arg Asp Phe 

255 260 265 270 

tgg ace etg aga tac aca acg age etc gae aat ggg agt ttt gtg gtt 1045 
Trp Thr Leu Arg Tyr Thr Thr Ser Leu Asp Asn Gly Ser Phe Val Val 
275 280 285 

tgt gag agg teg eta tct gge tct gga get ggg cct aat get get tea 1093 
Cys Glu Arg Ser Leu Ser Gly Ser Gly Ala Gly Pro Asn Ala Ala Ser 
290 295 300 

get tct eag ttt gtg aga gea gaa atg ett tct agt ggg tat tta ata 1141 
Ala Ser Gin Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu lie 
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305 310 315 

agg cct tgt gat ggt ggt ggt tct att att cac att gtc gat cac ctt 
Arg Pro Cys Asp Gly Gly Gly Ser lie lie His lie Val Asp His Leu 
320 325 330 

aat ctt gag get tgg agt gtt ccg gat gtg ctt cga ccc ctt tat gag 
Asn Leu Glu Ala Trp Ser Val Pro Asp Val Leu Arg Pro Leu Tyr Glu 
335 340 345 350 

tea tec aaa gtc gtt gca caa aaa atg acc att tec geg ttg egg tat 
Ser Ser Lys Val Val Ala Gin Lys Met Thr lie Ser Ala Leu Arg Tyr 
355 360 365 

ate agg caa tta gee caa gag tct aat ggt gaa gta gtg tat gga tta 
lie Arg Gin Leu Ala Gin Glu Ser Asn Gly Glu Val Val Tyr Gly Leu 

370 375 380 

gga agg cag cct get gtt ctt aga acc ttt age caa aga tta age agg 
Gly Arg Gin Pro Ala Val Leu Arg Thr Phe Ser Gin Arg Leu Ser Arg 

385 390 395 

ggc ttc aat gat gcg gtt aat ggg ttt ggt gac gac ggg tgg tct acg 
Gly Phe Asn Asp Ala Val Asn Gly Phe Gly Asp Asp Gly Trp Ser Thr 
400 405 410 

atg eat tgt gat gga gcg gaa gat att ate gtt get att aac tct aca 
Met His Cys Asp Gly Ala Glu Asp lie lie Val Ala lie Asn Ser Thr 
415 420 425 430 

aag cat ttg aat aat att tct aat tct ctt teg ttc ctt gga ggc gtg 
Lys His Leu Asn Asn lie Ser Asn Ser Leu Ser Phe Leu Gly Gly Val 
435 440 445 

etc tgt gcc aag get tea atg ctt etc caa aat gtt cct cct gcg gtt 
Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 
450 455 460 

ttg ate egg ttc ctt aga gag cat cga tct gag tgg get gat ttc aat 
Leu lie Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Phe Asn 
465 470 475 

gtt gat gca tat tec get get aca ctt aaa get ggt age ttt get tat 
Val Asp Ala Tyr Ser Ala Ala Thr Leu Lys Ala Gly Ser Phe Ala Tyr 
480 485 490 

ccg gga atg aga eca aca aga ttc act ggg agt cag ate ata atg cea 
Pro Gly Met Arg Pro Thr Arg Phe Thr Gly Ser Gin lie lie Met Pro 

495 500 505 510 

eta gga eat aca att gaa cac gaa gaa atg eta gaa gtt gtt aga etg 
Leu Gly His Thr lie Glu His Glu Glu Met Leu Glu Val Val Arg Leu 
515 520 525 

gaa ggt cat tct ctt get caa gaa gat gca ttt atg tea egg gat gtc 
Glu Gly His Ser Leu Ala Gin Glu Asp Ala Phe Met Ser Arg Asp Val 
530 535 540 

eat etc ctt cag att tgt acc ggg att gac gag aat gee gtt gga get 
His Leu Leu Gin lie Cys Thr Gly lie Asp Glu Asn Ala Val Gly Ala 
545 550 555 

tgt tct gaa etg ata ttt get ccg att aat gag atg ttc ccg gat gat 
Cys Ser Glu Leu lie Phe Ala Pro lie Asn Glu Met Phe Pro Asp Asp 
560 565 570 

get eca ctt gtt ccc tct gga ttc cga gtc ata ccc gtt gat get aaa 
Ala Pro Leu Val Pro Ser Gly Phe Arg Val lie Pro Val Asp Ala Lys 
575 580 585 590 

scg gga gat gta caa gat etg tta acc get aat cac cgt aca eta gac 
Thr Gly Asp Val Gin Asp Leu Leu Thr Ala Asn His Arg Thr Leu Asp 
595 600 605 

tta act tct age ctt gaa gtc ggt eca tea cct gag aat get tct gga 
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Leu Thr Ser Ser Leu Glu Val Gly Pro Ser Pro Glu Asn Ala Ser Gly 
610 615 620 

aac tct ttt tct age tea age teg aga tgt att etc aet ate gcg ttt 2101 
Asn Ser Phe Ser Ser Ser Ser Ser Arg Cys lie Leu Thr lie Ala Phe 
625 630 635 

caa ttc ect ttt gaa aac aac ttg caa gaa aat gtt get ggt atg get 2149 
Gin Phe Pro Phe Glu Asn Asn Leu Gin Glu Asn Val Ala Gly Met Ala 
640 645 650 

tgt cag tat gtg agg age gtg ate tea tea gtt caa cgt gtt gea atg 2197 
Cys Gin Tyr Val Arg Ser Val lie Ser Ser Val Gin Arg Val Ala Met 
655 660 665 670 

geg ate tea ceg tet ggg ata age eeg agt ctg gge tee aaa ttg tec 2245 
Ala lie Ser Pro Ser Gly lie Ser Pro Ser Leu Gly Ser Lys Leu Ser 
675 680 685 

cca gga tet ect gaa get gtt aet ett get eag tgg ate tet caa agt 2293 
Pro Gly Ser Pro Glu Ala Val Thr Leu Ala Gin Trp lie Ser Gin Ser 
690 695 700 

tac agt cat cac tta ggc teg gag ttg ctg acg att gat tea ctt gga 2341 
Tyr Ser His His Leu Gly Ser Glu Leu Leu Thr lie Asp Ser Leu Gly 
705 710 715 

age gac gac teg gta eta aaa ctt eta tgg gat cac caa gat gee ate 23 89 

Ser Asp Asp Ser Val Leu Lys Leu Leu Trp Asp His Gin Asp Ala lie 
720 725 730 

Ctg tgt tge tea tta aag eea cag cca gtg tte atg ttt gcg aac eaa 243 7 

Leu Cys Cys Ser Leu Lys Pro Gin Pro Val Phe Met Phe Ala Asn Gin 
735 740 745 750 

get ggt eta gae atg eta gag aca aea ett gta gee tta eaa gat ata 2485 
Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp lie 
755 760 765 

aea etc gaa aag ata tte gat gaa teg ggt cgt aag get ate tgt teg 2533 
Thr Leu Glu Lys lie Phe Asp Glu Ser Gly Arg Lys Ala He Cys Ser 
770 775 780 

gac ttc gee aag eta atg caa cag gga ttt get tge ttg ect tea gga 2581 
Asp Phe Ala Lys Leu Met Gin Gin Gly Phe Ala Cys Leu Pro Ser Gly 
785 790 795 

ate tgt gtg tea aeg atg gga aga eat gtg agt tat gaa eaa get gtt 2629 
He Cys Val Ser Thr Met Gly Arg His Val Ser Tyr Glu Gin Ala Val 
800 805 810 

get tgg aaa gtg ttt get gea tct gaa gaa aae aae aac aat ctg eat 2677 
Ala Trp Lys Val Phe Ala Ala Ser Glu Glu Asn Asn Asn Asn Leu His 
815 820 825 830 

tgt ctt gee ttc tec ttt gta aac tgg tct ttt gtg tga ttcgattgae 2726 
Cys Leu Ala Phe Ser Phe Val Asn Trp Ser Phe Val 
835 840 

agaaaaagac taatttaaat ttacgttaga gaactcaaat ttttggttgt tgtttaggtg 2786 

tctctgtttt gttttttaaa attattttga teaaa 2821 

<210> 16 
<211> 842 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

Met Glu Met Ala Val Ala Asn His Arg Glu Arg Ser Ser Asp Ser Met 
15 10 15 
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Asn Arg His Leu Asp Ser Ser Gly Lys Tyr Val Arg Tyr Thr Ala Glu 
20 25 30 

Gin Val Glu Ala Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser 
35 40 45 

Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Ser lie Leu Ala Asn 
50 55 ' 60 

lie Glu Pro Lys Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 75 80 

Asp Lys Gin Arg Lys Glu Ala Ser Arg Leu Gin Ser Val Asn Arg Lys 
85 90 95 

Leu Ser Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

Lys Gin Val Ser Gin Leu Val Cys Glu Asn Gly Tyr Met Lys Gin Gin 
115 120 125 

Leu Thr Thr Val Val Asn Asp Pro Ser Cys Glu Ser Val Val Thr Thr 
130 135 140 

Pro Gin His Ser Leu Arg Asp Ala Asn Ser Pro Ala Gly Leu Leu Ser 
145 150 155 160 

lie Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 
165 170 175 

Ala Val Asp Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 190 

Val Gly lie Phe Ala lie Ser Gin Arg Cys Asn Gly Val Ala Ala Arg 
195 200 205 

Ala Cys Gly Leu Val Ser Leu Glu Pro Met Lys lie Ala Glu lie Leu 
210 215 220 

Lys Asp Arg Pro Ser Trp Phe Arg Asp Cys Arg Ser Leu Glu Val Phe 
225 230 235 240 

Thr Met Phe Pro Ala Gly Asn Gly Gly Thr lie Glu Leu Val Tyr Met 
245 250 255 

Gin Thr Tyr Ala Pro Thr Thr Leu Ala Pro Ala Arg Asp Phe Trp Thr 
260 265 270 

Leu Arg Tyr Thr Thr Ser Leu Asp Asn Gly Ser Phe Val Val Cys Glu 
275 280 285 

Arg Ser Leu Ser Gly Ser Gly Ala Gly Pro Asn Ala Ala Ser Ala Ser 
290 295 300 

Gin Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu lie Arg Pro 
305 310 315 320 
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Cys Asp Gly Gly Gly Ser lie He His He Val Asp His Leu Asn Leu 
325 330 335 

Glu Ala Trp Ser Val Pro Asp Val Leu Arg Pro Leu Tyr Glu Ser Ser 
340 345 350 

Lys Val Val Ala Gin Lys Met Thr He Ser Ala Leu Arg Tyr He Arg 
355 360 365 

Gin Leu Ala Gin Glu Ser Asn Gly Glu Val Val Tyr Gly Leu Gly Arg 
370 375 380 

Gin Pro Ala Val Leu Arg Thr Phe Ser Gin Arg Leu Ser Arg Gly Phe 
385 390 395 400 

Asn Asp Ala Val Asn Gly Phe Gly Asp Asp Gly Trp Ser Thr Met His 
405 410 415 

Cys Asp Gly Ala Glu Asp He He Val Ala He Asn Ser Thr Lys His 
420 425 430 

Leu Asn Asn He Ser Asn Ser Leu Ser Phe Leu Gly Gly Val Leu Cys 
435 440 445 

Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val Leu He 
450 455 460 

Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Phe Asn Val Asp 
465 470 475 480 

Ala Tyr Ser Ala Ala Thr Leu Lys Ala Gly Ser Phe Ala Tyr Pro Gly 
485 490 495 

Met Arg Pro Thr Arg Phe Thr Gly Ser Gin He He Met Pro Leu Gly 
500 505 510 

His Thr He Glu His Glu Glu Met Leu Glu Val Val Arg Leu Glu Gly 
515 520 525 

His Ser Leu Ala Gin Glu Asp Ala Phe Met Ser Arg Asp Val His Leu 
530 535 540 

Leu Gin He Cys Thr Gly He Asp Glu Asn Ala Val Gly Ala Cys Ser 
545 550 555 560 

Glu Leu He Phe Ala Pro He Asn Glu Met Phe Pro Asp Asp Ala Pro 
565 570 575 

Leu Val Pro Ser Gly Phe Arg Val He Pro Val Asp Ala Lys Thr Gly 
580 585 590 

Asp Val Gin Asp Leu Leu Thr Ala Asn His Arg Thr Leu Asp Leu Thr 
595 600 605 

Ser Ser Leu Glu Val Gly Pro Ser Pro Glu Asn Ala Ser Gly Asn Ser 
610 615 620 
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Phe Ser Ser Ser Ser Ser Arg Cys He Leu Thr He Ala Phe Gin Phe 
625 630 635 640 

Pro Phe Glu Asn Asn Leu Gin Glu Asn Val Ala Gly Met Ala Cys Gin 
645 650 655 

Tyr Val Arg Ser Val He Ser Ser Val Gin Arg Val Ala Met Ala He 
660 665 670 

Ser Pro Ser Gly He Ser Pro Ser Leu Gly Ser Lys Leu Ser Pro Gly 
675 680 685 

Ser Pro Glu Ala Val Thr Leu Ala Gin Trp He Ser Gin Ser Tyr Ser 
690 695 700 

His His Leu Gly Ser Glu Leu Leu Thr He Asp Ser Leu Gly Ser Asp 
705 710 715 720 

Asp Ser Val Leu Lys Leu Leu Trp Asp His Gin Asp Ala He Leu Cys 
725 730 735 

Cys Ser Leu Lys Pro Gin Pro Val Phe Met Phe Ala Asn Gin Ala Gly 
740 745 750 

Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Thr Leu 
755 760 765 

Glu Lys He Phe Asp Glu Ser Gly Arg Lys Ala He Cys Ser Asp Phe 
770 775 780 

Ala Lys Leu Met Gin Gin Gly Phe Ala Cys Leu Pro Ser Gly He Cys 
785 790 795 800 

Val Ser Thr Met Gly Arg His Val Ser Tyr Glu Gin Ala Val Ala Trp 
805 810 815 

Lys Val Phe Ala Ala Ser Glu Glu Asn Asn Asn Asn Leu His Cys Leu 
820 825 830 

Ala Phe Ser Phe Val Asn Trp Ser Phe Val 

840 





835 


<210> 


17 


<211> 


1888 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(326) . . (1708) 


<223> 


G571 


<400> 


17 



tagccgacct ctcttctctc ttctgaaaaa aacaccaaag gagctttaaa tgctccgtta 60 

cataatctct atctctttcc aagaatatag agaaaggaaa ataatataca agaattaaaa 12 0 

gaaggtatat catcatctct ctagctagtg atcaaagcac cgtcatcatc atcatatatc 180 
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atcagcttgc ctcagaggag aagaccaaca taagagagat cgaagatcaa aatctatctc 240 

tcttcatcat cttctgctgt tactatcata tcacacgctc tctcaaacat catcctatat 300 

atagacttct cttcatcatc atcaa atg caa ggt cat cac cag aat cat cat 352 

Met Gin Gly His His Gin Asn His His 
1 5 

caa cac tta tea tea tec tec gee aeg tct tec cat gga aac ttc atg 400 

Gin His Leu Ser Ser Ser Ser Ala Thr Ser Ser His Gly Asn Phe Met 

10 15 20 25 

aac aaa gat ggg tat gat att gga gag ata gac cea tea etc ttc etc 448 
Asn Lys Asp Gly Tyr Asp lie Gly Glu lie Asp Pro Ser Leu Phe Leu 
30 35 40 

tat ctt gat gga caa gga cat cat gat cct cca tea act get cct tct 496 
Tyr Leu Asp Gly Gin Gly His His Asp Pro Pro Ser Thr Ala Pro Ser 
45 50 55 

cct tta cat cat cat cac aca act cag aat ttg gcg atg aga cct cca 544 
Pro Leu His His His His Thr Thr Gin Asn Leu Ala Met Arg Pro Pro 
60 65 70 

aca teg aeg etc aac ate ttt cca tet cag cct atg cac ata gag cca 592 
Thr Ser Thr Leu Asn lie Phe Pro Ser Gin Pro Met His lie Glu Pro 
75 80 85 

cct cct tct tct aca cac aat ace gat aat aca aga tta gtt eeg get 640 
Pro Pro Ser Ser Thr His Asn Thr Asp Asn Thr Arg Leu Val Pro Ala 
90 95 100 105 

get caa cct agt ggt tec act ega cca get tct gac eeg tec atg gac 688 
Ala Gin Pro Ser Gly Ser Thr Arg Pro Ala Ser Asp Pro Ser Met Asp 
110 115 120 

ttg ace aat cat tct cag ttt cat caa cct cct caa ggt tct aaa tec 736 
Leu Thr Asn His Ser Gin Phe His Gin Pro Pro Gin Gly Ser Lys Ser 
125 130 135 

ate aag aag gaa ggg aac ege aag ggt ctt gee tea teg gac cat gac 784 
lie Lys Lys Glu Gly Asn Arg Lys Gly Leu Ala Ser Ser Asp His Asp 
140 145 150 

ata cct aaa teg tea gac cct aaa aca ttg aga aga eta gca caa aac 832 
lie Pro Lys Ser Ser Asp Pro Lys Thr Leu Arg Arg Leu Ala Gin Asn 
155 160 165 

aga gaa gca gca aga aaa age aga tta cgt aaa aag get tat gtt cag 880 
Arg Glu Ala Ala Arg Lys Ser Arg Leu Arg Lys Lys Ala Tyr Val Gin 
170 175 180 185 

caa etc gag tea tgt agg ate aaa ctg acc caa eta gaa caa gag att 928 
Gin Leu Glu Ser Cys Arg lie Lys Leu Thr Gin Leu Glu Gin Glu lie 
190 195 200 

caa egg gee aga tee caa gge gta ttc ttt gga ggg tet ctt ata gga 976 
Gin Arg Ala Arg Ser Gin Gly Val Phe Phe Gly Gly Ser Leu lie Gly 
205 210 215 

gga gat caa cag caa ggt gga eta ecc att gge cct gge aac ate age 1024 
Gly Asp Gin Gin Gin Gly Gly Leu Pro lie Gly Pro Gly Asn lie Ser 
220 225 230 

tet gaa gca gcg gtg ttc gat atg gaa tat gcg agg tgg ctg gag gag 10 72 

Ser Glu Ala Ala Val Phe Asp Met Glu Tyr Ala Arg Trp Leu Glu Glu 

235 240 245 

cag cag agg eta tta aac gaa eta agg gtg gca aca caa gaa cac ttg 112 0 

Gin Gin Arg Leu Leu Asn Glu Leu Arg Val Ala Thr Gin Glu His Leu 
250 255 260 265 

tec gag aac gag ctt agg atg ttt gtg gac aca tgt tta get eat tat 1168 
Ser Glu Asn Glu Leu Arg Met Phe Val Asp Thr Cys Leu Ala His Tyr 
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270 275 280 

gac cat ttg att aac etc aag get atg gtc get aag ace gat gtc ttc 1216 
Asp His Leu lie Asn Leu Lys Ala Met Val Ala Lys Thr Asp Val Phe 
285 290 295 

cac etc att tct gga gca tgg aaa act cca get gaa cgt tgc ttc ttg 1264 
His Leu lie Ser Gly Ala Trp Lys Thr Pro Ala Glu Arg Cys Phe Leu 
300 305 310 

tgg atg ggt ggt ttc cgt cca teg gag ate att aag gtg att gtg aac 1312 
Trp Met Gly Gly Phe Arg Pro Ser Glu lie lie Lys Val lie Val Asn 
315 320 325 

cag ata gaa cca ttg acg gag caa cag ata gtt ggg ata tgt ggg etg 1360 
Gin lie Glu Pro Leu Thr Glu Gin Gin lie Val Gly lie Cys Gly Leu 

330 335 340 345 

caa cag tee aca caa gag gee gag gag get etc teg caa gge etc gag 1408 
Gin Gin Ser Thr Gin Glu Ala Glu Glu Ala Leu Ser Gin Gly Leu Glu 
350 355 360 

geg ttg aat caa tea ett tec gat age att gtc tct gac tec etc ccg 1456 
Ala Leu Asn Gin Ser Leu Ser Asp Ser lie Val Ser Asp Ser Leu Pro 
365 370 375 

cct gcc tec gca cca ett cct cct cat eta tec aat ttc atg tea cac 1504 
Pro Ala Ser Ala Pro Leu Pro Pro His Leu Ser Asn Phe Met Ser His 
380 385 390 

atg tec tta get etc aac aag etc tct get etc gag ggc tte gtt etc 1552 
Met Ser Leu Ala Leu Asn Lys Leu Ser Ala Leu Glu Gly Phe Val Leu 
395 400 405 

cag geg gat aat ttg agg cac caa acg ate eat agg etg aac caa ttg 1600 
Gin Ala Asp Asn Leu Arg His Gin Thr lie His Arg Leu Asn Gin Leu 
410 415 420 425 

ttg acg ace cgt caa gaa gca egg tgt ett eta gcc gtt geg gag tac 1648 
Leu Thr Thr Arg Gin Glu Ala Arg Cys Leu Leu Ala Val Ala Glu Tyr 
430 435 440 

ttc cac cgt ett caa get eta agt tct etc tgg eta gee cgt cct egg 1696 
Phe His Arg Leu Gin Ala Leu Ser Ser Leu Trp Leu Ala Arg Pro Arg 
445 450 455 

caa gat gga taa tactaaaaca actgatgaag gaaaccaaaa acaaaaacaa 174 8 
Gin Asp Gly 
460 

gagaataggt tgattagtta gccgccagct tgacctcttt atcatatata tcgtctctct 1808 

actcaaatac agtgcaatta gggaaaattg tttggcttet ttttggtata tgattcttae 1868 

tattatgttt ttaatcaaga 1888 

<210> 18 
<211> 460 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Met Gin Gly His His Gin Asn His His Gin His Leu Ser Ser Ser Ser 
15 10 15 

Ala Thr Ser Ser His Gly Asn Phe Met Asn Lys Asp Gly Tyr Asp lie 
20 25 30 

Gly Glu lie Asp Pro Ser Leu Phe Leu Tyr Leu Asp Gly Gin Gly His 
35 40 45 
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His Asp Pro Pro Ser Thr Ala Pro Ser Pro Leu His His His His Thr 
50 55 60 

Thr Gin Asn Leu Ala Met Arg Pro Pro Thr Ser Thr Leu Asn lie Phe 
65 70 75 80 

Pro Ser Gin Pro Met His lie Glu Pro Pro Pro Ser Ser Thr His Asn 
85 90 95 

Thr Asp Asn Thr Arg Leu Val Pro Ala Ala Gin Pro Ser Gly Ser Thr 
100 105 110 

Arg Pro Ala Ser Asp Pro Ser Met Asp Leu Thr Asn His Ser Gin Phe 
115 120 125 

His Gin Pro Pro Gin Gly Ser Lys Ser lie Lys Lys Glu Gly Asn Arg 
130 135 140 

Lys Gly Leu Ala Ser Ser Asp His Asp lie Pro Lys Ser Ser Asp Pro 
145 150 155 160 

Lys Thr Leu Arg Arg Leu Ala Gin Asn Arg Glu Ala Ala Arg Lys Ser 
165 170 175 

Arg Leu Arg Lys Lys Ala Tyr Val Gin Gin Leu Glu Ser Cys Arg lie 
180 185 190 

Lys Leu Thr Gin Leu Glu Gin Glu lie Gin Arg Ala Arg Ser Gin Gly 
195 200 205 

Val Phe Phe Gly Gly Ser Leu lie Gly Gly Asp Gin Gin Gin Gly Gly 
210 215 220 

Leu Pro lie Gly Pro Gly Asn lie Ser Ser Glu Ala Ala Val Phe Asp 
225 230 235 240 

Met Glu Tyr Ala Arg Trp Leu Glu Glu Gin Gin Arg Leu Leu Asn Glu 
245 250 255 

Leu Arg Val Ala Thr Gin Glu His Leu Ser Glu Asn Glu Leu Arg Met 
260 265 270 

Phe Val Asp Thr Cys Leu Ala His Tyr Asp His Leu He Asn Leu Lys 
275 280 285 

Ala Met Val Ala Lys Thr Asp Val Phe His Leu He Ser Gly Ala Trp 
290 295 300 

Lys Thr Pro Ala Glu Arg Cys Phe Leu Trp Met Gly Gly Phe Arg Pro 
305 310 315 320 

Ser Glu He lie Lys Val lie Val Asn Gin lie Glu Pro Leu Thr Glu 
325 330 335 

Gin Gin He Val- Gly He Cys Gly Leu Gin Gin Ser Thr Gin Glu Ala 
340 345 350 
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Glu Glu Ala Leu Ser Gin Gly Leu Glu Ala Leu Asn Gin Ser Leu Ser 
355 360 365 

Asp Ser lie Val Ser Asp Ser Leu Pro Pro Ala Ser Ala Pro Leu Pro 
370 375 380 

Pro His Leu Ser Asn Phe Met Ser His Met Ser Leu Ala Leu Asn Lys 
385 390 395 400 

Leu Ser Ala Leu Glu Gly Phe Val Leu Gin Ala Asp Asn Leu Arg His 
405 410 415 

Gin Thr lie His Arg Leu Asn Gin Leu Leu Thr Thr Arg Gin Glu Ala 
420 425 430 

Arg Cys Leu Leu Ala Val Ala Glu Tyr Phe His Arg Leu Gin Ala Leu 
435 440 445 

Ser Ser Leu Trp Leu Ala Arg Pro Arg Gin Asp Gly 
450 455 460 



<210> 


19 


<211> 


1707 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(98) . . (1444) 


<223> 


G748 


<400> 


19 



ccacgcgtcc gcactctccc aaatctctct tctttaacaa caaaaaaaaa atcacagaga 60 

catagagaga agaagacgga acagaggctc caaaaaa atg atg atg gag act aga 115 

Met Met Met Glu Thr Arg 

1 5 

gat cca get att aag ctt ttc ggt atg aaa ate cct ttt ccg teg gtt 163 
Asp Pro Ala lie Lys Leu Phe Gly Met Lys lie Pro Phe Pro Ser Val 
10 15 20 

ttt gaa teg gea gtt acg gtg gag gat gac gaa gaa gat gac tgg age 211 
Phe Glu Ser Ala Val Thr Val Glu Asp Asp Glu Glu Asp Asp Trp Ser 
25 30 35 

99c gga gat gac aaa tea cca gag aag gta act cca gag tta tea gat 259 
Gly Gly Asp Asp Lys Ser Pro Glu Lys Val Thr Pro Glu Leu Ser Asp 
40 45 50 

aag aac aac aac aae tgt aac gac aac agt ttt aac aat teg aaa ccc 3 07 

Lys Asn Asn Asn Asn Cys Asn Asp Asn Ser Phe Asn Asn Ser Lys Pro 
55 60 65 70 

gaa acc ttg gac aaa gag gaa gcg aca tea act gat cag ata gag agt 355 
Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser Thr Asp Gin He Glu Ser 
75 80 85 

agt gac acg cct gag gat aat cag cag acg aca cct gat ggt aaa acc 403 
Ser Asp Thr Pro Glu Asp Asn Gin Gin Thr Thr Pro Asp Gly Lys Thr 
90 95 100 

eta aag aaa ccg act aag att eta ccg tgt eeg aga tgc aaa age atg 451 
Leu Lys Lys Pro Thr Lys He Leu Pro Cys Pro Arg Cys Lys Ser Met 
105 110 115 
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gag acc aag ttc tgt tat tac aac aac tac aac ata aac cag cct cgt 
Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr Asn lie Asn Gin Pro Arg 
120 125 130 

cat ttc tgc aag get tgt cag aga tat tgg act get gga ggg act atg 
His Phe Cys Lys Ala Cys Gin Arg Tyr Trp Thr Ala Gly Gly Thr Met 
135 140 145 150 

agg aat gtt cct gtg ggg gca gga cgt cgt aag aac aaa age tea tct 
Arg Asn Val Pro Val Gly Ala Gly Arg Arg Lys Asn Lys Ser Ser Ser 
155 160 165 

tct cat tac cgt cac ate act att tec gag get ett gag get gcg agg 
Ser His Tyr Arg His lie Thr lie Ser Glu Ala Leu Glu Ala Ala Arg 
170 175 180 

ett gae ccg gge tta eag gea aac aea agg gte ttg agt ttt ggt etc 
Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg Val Leu Ser Phe Gly Leu 
185 190 195 

gaa get cag eag eag eae gtt get get ccc atg aea cct gtt atg aag 
Glu Ala Gin Gin Gin His Val Ala Ala Pro Met Thr Pro Val Met Lys 
200 205 210 

eta caa gaa gat caa aag gtc tea aac ggt get agg aac agg ttt cac 
Leu Gin Glu Asp Gin Lys Val Ser Asn Gly Ala Arg Asn Arg Phe His 
215 220 225 230 

ggg tta gcg gat caa egg ett gta get egg gta gag aat gga gat gat 
Gly Leu Ala Asp Gin Arg Leu Val Ala Arg Val Glu Asn Gly Asp Asp 

235 240 245 

tgc tea age gga tec tct gtg ace acc tct aac aat cac tea gtg gat 
Cys Ser Ser Gly Ser Ser Val Thr Thr Ser Asn Asn His Ser Val Asp 
250 255 260 

gaa tea aga gca caa age gge agt gtt gtt gaa gea caa atg aac aac 
Glu Ser Arg Ala Gin Ser Gly Ser Val Val Glu Ala Gin Met Asn Asn 
265 270 275 

aac aac aac aat aac atg aat ggt tat get tgc ate cca ggt gtt cca 
Asn Asn Asn Asn Asn Met Asn Gly Tyr Ala Cys lie Pro Gly Val Pro 
280 285 290 

tgg cct tac acg tgg aat cca gcg atg cct cca cca ggt ttt tac ccg 
Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro Pro Pro Gly Phe Tyr Pro 
295 300 305 310 

cct cca ggg tat cca atg ccg ttt tac cct tac tgg acc ate cca atg 
Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro Tyr Trp Thr lie Pro Met 
315 320 325 

eta cca ccg cat caa tec tea teg cct ata age caa aag tgt tea aat 
Leu Pro Pro His Gin Ser Ser Ser Pro lie Ser Gin Lys Cys Ser Asn 
330 335 340 

aea aac tct ccg act etc gga aag cat ccg aga gat gaa gga tea teg 
Thr Asn Ser Pro Thr Leu Gly Lys His Pro Arg Asp Glu Gly Ser Ser 
345 350 355 

aaa aag gac aat gag aca gag cga aaa cag aag gee ggg tgc gtt ctg 
Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin Lys Ala Gly Cys Val Leu 
360 365 370 

gtc ccg aaa acg ttg aga ata gat gat cct aac gaa gea gca aag age 
Val Pro Lys Thr Leu Arg lie Asp Asp Pro Asn Glu Ala Ala Lys Ser 

375 380 385 390 

teg ata tgg aca aca ttg gga ate aag aac gag gcg atg tgc aaa gee 
Ser lie Trp Thr Thr Leu Gly lie Lys Asn Glu Ala Met Cys Lys Ala 
395 400 405 

ggt ggt atg ttc aaa ggg ttt gat cat aag aca aag atg tat aac aac 
Gly Gly Met Phe Lys Gly Phe Asp His Lys Thr Lys Met Tyr Asn Asn 
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499 

547 

595 

643 

691 

739 

787 

835 

883 

931 

979 
1027 
1075 
1123 
1171 
1219 
1267 
1315 
1363 
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410 415 420 

gac aaa get gag aac tec cct gtt ctt tct get aac cct get get eta 1411 
Asp Lys Ala Glu Asn Ser Pro Val Leu Ser Ala Asn Pro Ala Ala Leu 
425 430 435 

tea aga tea cac aat ttc cat gaa cag att tag agttacatat gtatatgtat 1464 
Ser Arg Ser His Asn Phe His Glu Gin lie 



440 




445 










atatgtatga 


ttgattgtat 


gtatagatga 


tactggagaa 


tgatgagttt 


ttgagaatca 


1524 


aaetcttttc 


ttctttetag 


tgattgectt 


tattccttta 


eatgttttgg 


ttctetgtae 


1584 


aetatttgat 


ttaccttttt 


taetttettt 


cttcatttgt 


eaggaaatgt 


tggaagataa 


1644 


cattaatggt 


aaaaagttgg 


tgtggaccgt 


tgttgegttg 


geatttcaaa 


aaaaaaaaaa 


1704 


aaa 












1707 



<210> 20 
<211> 448 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 20 

Met Met Met Glu Thr Arg Asp Pro Ala lie Lys Leu Phe Gly Met Lys 
15 10 15 

lie Pro Phe Pro Ser Val Phe Glu Ser Ala Val Thr Val Glu Asp Asp 
20 25 30 

Glu Glu Asp Asp Trp Ser Gly Gly Asp Asp Lys Ser Pro Glu Lys Val 
35 40 45 

Thr Pro Glu Leu Ser Asp Lys Asn Asn Asn Asn Cys Asn Asp Asn Ser 
50 55 60 

Phe Asn Asn Ser Lys Pro Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser 
65 70 75 80 

Thr Asp Gin lie Glu Ser Ser Asp Thr Pro Glu Asp Asn Gin Gin Thr 
85 90 95 

Thr Pro Asp Gly Lys Thr Leu Lys Lys Pro Thr Lys lie Leu Pro Cys 
100 105 110 

Pro Arg Cys Lys Ser Met Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr 
115 120 125 

Asn lie Asn Gin Pro Arg His Phe Cys Lys Ala Cys Gin Arg Tyr Trp 
130 135 140 

Thr Ala Gly Gly Thr Met Arg Asn Val Pro Val Gly Ala Gly Arg Arg 
145 150 155 160 

Lys Asn Lys Ser Ser Ser Ser His Tyr Arg His lie Thr lie Ser Glu 
165 170 175 

Ala Leu Glu Ala Ala Arg Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg 
180 185 190 
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Val Leu Ser Phe Gly Leu Glu Ala Gin Gin Gin HisVal Ala Ala Pro 
195 200 205 

Met Thr Pro Val Met Lys Leu Gin Glu Asp Gin Lys Val Ser Asn Gly 
210 215 220 

Ala Arg Asn Arg Phe His Gly Leu Ala Asp Gin Arg Leu Val Ala Arg 
225 230 235 240 

Val Glu Asn Gly Asp Asp Cys Ser Ser Gly Ser Ser Val Thr Thr Ser 
245 250 255 

Asn Asn His Ser Val Asp Glu Ser Arg Ala Gin Ser Gly Ser Val Val 
260 265 270 

Glu Ala Gin Met Asn Asn Asn Asn Asn Asn Asn Met Asn Gly Tyr Ala 
275 280 285 

Cys lie Pro Gly Val Pro Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro 
290 295 300 

Pro Pro Gly Phe Tyr Pro Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro 
305 310 315 320 

Tyr Trp Thr lie Pro Met Leu Pro Pro His Gin Ser Ser Ser Pro lie 
325 330 335 

Ser Gin Lys Cys Ser Asn Thr Asn Ser Pro Thr Leu Gly Lys His Pro 
340 345 350 

Arg Asp Glu Gly Ser Ser Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin 
355 360 365 

Lys Ala Gly Cys Val Leu Val Pro Lys Thr Leu Arg lie Asp Asp Pro 
370 375 380 

Asn Glu Ala Ala Lys Ser Ser lie Trp Thr Thr Leu Gly lie Lys Asn 
385 390 395 400 

Glu Ala Met Cys Lys Ala Gly Gly Met Phe Lys Gly Phe Asp His Lys 
405 410 415 

Thr Lys Met Tyr Asn Asn Asp Lys Ala Glu Asn Ser Pro Val Leu Ser 
420 425 430 

Ala Asn Pro Ala Ala Leu Ser Arg Ser His Asn Phe His Glu Gin lie 

440 445 





435 


<210> 


21 


<211> 


1149 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (1149) 


<223> 


G431 



Page 34 



wo 01/36444 PCT/USOO/31325 

MBI0018 Sequence Listing. ST25 

<400> 21 

atg gag agt ggt tec aac age act tct tgt cca atg get ttt gee ggg 48 
Met Glu Ser Gly Ser Asn Ser Thr Ser Cys Pro Met Ala Phe Ala Gly 
15 10 15 

gat aat agt gat ggt cog atg tgt cct atg atg atg atg atg cog ccc 96 
Asp Asn Ser Asp Gly Pro Met Cys Pro Met Met Met Met Met Pro Pro 
20 25 30 

ate atg aca tea cat caa cat eat ggt cat gat cat caa cat caa caa 144 
lie Met Thr Ser His Gin His His Gly His Asp His Gin His Gin Gin 

35 40 45 

caa gaa eat gat ggt tat gea tat eag tea cac eae caa caa agt agt 192 
Gin Glu His Asp Gly Tyr Ala Tyr Gin Ser His His Gin Gin Ser Ser 
50 55 60 

tec ctt ttt ett caa tea eta get cct ccc caa gga act aag aac aaa 240 
Ser Leu Phe Leu Gin Ser Leu Ala Pro Pro Gin Gly Thr Lys Asn Lys 
65 70 75 80 

gtt get tct tct tct tct cct tec tct tgt get cct gee tat tct eta 288 
Val Ala Ser Ser Ser Ser Pro Ser Ser Cys Ala Pro Ala Tyr Ser Leu 
85 90 95 

atg gag ate eat cat aac gaa ate gtt gea gga gga ate aac cct tgc 336 
Met Glu lie His His Asn Glu lie Val Ala Gly Gly lie Asn Pro Cys 
100 105 110 

tee tct tte tct tct tea gee tct gte aag gee aag ate atg get cat 384 
Ser Ser Phe Ser Ser Ser Ala Ser Val Lys Ala Lys lie Met Ala His 
115 120 125 

cct cac tac cac cge etc ttg gee get tat gtc aat tgt cag aag gtt 432 
Pro His Tyr His Arg Leu Leu Ala Ala Tyr Val Asn Cys Gin Lys Val 
130 135 140 

gga gea cca ccg gag gtt gtg gcg agg ctg gag gag gea tgc teg tct 480 
Gly Ala Pro Pro Glu Val Val Ala Arg Leu Glu Glu Ala Cys Ser Ser 
145 150 155 160 

gee gea gcc gea gee gea tct atg ggg cca aca ggg tgt ctt ggt gaa 528 
Ala Ala Ala Ala Ala Ala Ser Met Gly Pro Thr Gly Cys Leu Gly Glu 

165 170 175 

gat cca ggg ctt gat caa ttc atg gaa get tac tgt gaa atg etc gtt 576 
Asp Pro Gly Leu Asp Gin Phe Met Glu Ala Tyr Cys Glu Met Leu Val 
180 185 190 

aag tat gag caa gag etc tec aaa cct ttc aag gaa get atg gtc ttc 624 
Lys Tyr Glu Gin Glu Leu Ser Lys Pro Phe Lys Glu Ala Met Val Phe 
195 200 205 

ctt caa cgt gtc gag tgt caa ttc aaa tec etc tct eta tec tea cct 672 
Leu Gin Arg Val Glu Cys Gin Phe Lys Ser Leu Ser Leu Ser Ser Pro 
210 215 220 

tec tct ttc tec ggt tat gga gag aca gea att gat agg aac aat aat 720 
Ser Ser Phe Ser Gly Tyr Gly Glu Thr Ala lie Asp Arg Asn Asn Asn 
225 230 235 240 

ggg tea tec gag gaa gaa gtc gat atg aac aat gaa ttt gta gat cca 768 
Gly Ser Ser Glu Glu Glu Val Asp Met Asn Asn Glu Phe Val Asp Pro 
245 250 255 

caa get gag gat aga gag ctt aaa gga eag etc ttg cgc aag tac agt 816 
Gin Ala Glu Asp Arg Glu Leu Lys Gly Gin Leu Leu Arg Lys Tyr Ser 
260 265 270 

ggt tac tta ggg age etc aag caa gag ttc atg aag aag agg aag aaa 864 
Gly Tyr Leu Gly Ser Leu Lys Gin Glu Phe Met Lys Lys Arg Lys Lys 
275 280 285 

gga aag etc cct aaa gaa get cgt caa caa ctg ett gat tgg tgg age 912 
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Gly Lys Leu Pro Lys Glu Ala Arg Gin Gin Leu Leu Asp Trp Trp Ser 
290 295 300 

cgt cac tac aaa tgg cct tac cct teg gag caa caa aag etc gcc ctt 960 
Arg His Tyr Lys Trp Pro Tyr Pro Ser Glu Gin Gin Lys Leu Ala Leu 
305 310 ^ 315 320 

gcg gaa tea acg ggg ctg gac cag aaa cag ata aac aat tgg ttc ata 1008 
Ala Glu Ser Thr Gly Leu Asp Gin Lys Gin lie Asn Asn Trp Phe He 
325 330 335 

aac cag agg aaa egg cat tgg aag ecg teg gag gac atg cag ttt gta 1056 
Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asp Met Gin Phe Val 
340 345 350 

gta atg gac gca aca cat cct cac cat tac ttc atg gat aat gtc ttg 1104 
Val Met Asp Ala Thr His Pro His His Tyr Phe Met Asp Asn Val Leu 
355 360 365 

gac aat cct ttc cca atg gat cac ate tec tec ace atg ctt tga 1149 
Asp Asn Pro Phe Pro Met Asp His He Ser Ser Thr Met Leu 
370 375 . 380 

<210> 22 
<211> 382 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 22 

Met Glu Ser Gly Ser Asn Ser Thr Ser Cys Pro Met Ala Phe Ala Gly 
15 10 15 

Asp Asn Ser Asp Gly Pro Met Cys Pro Met Met Met Met Met Pro Pro 
20 25 30 

He Met Thr Ser His Gin His His Gly His Asp His Gin His Gin Gin 
35 40 45 

Gin Glu His Asp Gly Tyr Ala Tyr Gin Ser His His Gin Gin Ser Ser 
50 55 60 

Ser Leu Phe Leu Gin Ser Leu Ala Pro Pro Gin Gly Thr Lys Asn Lys 
65 70 75 80 

Val Ala Ser Ser Ser Ser Pro Ser Ser Cys Ala Pro Ala Tyr Ser Leu 
85 90 95 

Met Glu He His His Asn Glu He Val Ala Gly Gly He Asn Pro Cys 
100 105 110 

Ser Ser Phe Ser Ser Ser Ala Ser Val Lys Ala Lys He Met Ala His 
115 120 125 

Pro His Tyr His Arg Leu Leu Ala Ala Tyr Val Asn Cys Gin Lys Val 
130 135 140 

Gly Ala Pro Pro Glu Val Val Ala Arg Leu Glu Glu Ala Cys Ser Ser 
145' 150 155 160 

Ala Ala Ala Ala Ala Ala Ser Met Gly Pro Thr Gly Cys Leu Gly Glu 
165 170 175 
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Asp Pro Gly Leu Asp Gin Phe Met Glu Ala Tyr Cys Glu Met Leu Val 
180 185 190 

Lys Tyr Glu Gin Glu Leu Ser Lys Pro Phe Lys Glu Ala Met Val Phe 
195 200 205 

Leu Gin Arg Val Glu Cys Gin Phe Lys Ser Leu Ser Leu Ser Ser Pro 
210 215 220 

Ser Ser Phe Ser Gly Tyr Gly Glu Thr Ala lie Asp Arg Asn Asn Asn 
225 230 235 240 

Gly Ser Ser Glu Glu Glu Val Asp Met Asn Asn Glu Phe Val Asp Pro 
245 250 255 

Gin Ala Glu Asp Arg Glu Leu Lys Gly Gin Leu Leu Arg Lys Tyr Ser 
260 265 270 

Gly Tyr Leu Gly Ser Leu Lys Gin Glu Phe Met Lys Lys Arg Lys Lys 
275 280 285 

Gly Lys Leu Pro Lys Glu Ala Arg Gin Gin Leu Leu Asp Trp Trp Ser 
290 295 300 

Arg His Tyr Lys Trp Pro Tyr Pro Ser Glu Gin Gin Lys Leu Ala Leu 
305 310 315 320 

Ala Glu Ser Thr Gly Leu Asp Gin Lys Gin lie Asn Asn Trp Phe lie 
325 330 335 

Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asp Met Gin Phe Val 
340 345 350 

Val Met Asp Ala Thr His Pro His His Tyr Phe Met Asp Asn Val Leu 
355 360 365 

Asp Asn Pro Phe Pro Met Asp His lie Ser Ser Thr Met Leu 
370 375 380 



<210> 


23 


<211> 


1136 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(118) . . (1074) 


<223> 


G187 


<400> 


23 



tagacctctt aggaaaaaaa cctaaaaacc taatccccaa acctaaaagg cttatctcat 60 

ctcttcttct ttgtcttctt tactcttttt ttacctctct cttcattgtt cttcacc 117 

atg tct aat gaa acc aga gat etc tac aac tac caa tac act tea teg 165 
Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gin Tyr Pro Ser Ser 
15 10 15 

ttt teg ttg eac gaa atg atg aat ctg cct act tea aat cca tct tct 213 
Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser Asn Pro Ser Ser 
20 25 30 
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tat gga aac etc cca tea caa aac ggt ttt aat cca tct act tat tec 261 
Tyr Gly Asn Leu Pro Ser Gin Asn Gly Phe Asn Pro Ser Thr Tyr Ser 
35 40 45 



ttc ace gat tgt etc caa agt tct cca gea gcg tat gaa tct eta ett 
Phe Thr Asp Cys Leu Gin Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 
50 55 60 



ttg agg gag att ttt cct tea att ttc ttt aag caa gag cet tga 
Leu Arg Glu lie Phe Pro Ser lie Phe Phe Lys Gin Glu Pro 
305 310 315 



309 



cag aaa act ttt ggt ctt tct ccc tct tec tea gag gtt ttc aat tct 357 
Gin Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 
65 70 75 80 

teg ate gat caa gaa ecg aac egt gat gtt act aat gae gta ate aat 405 
Ser lie Asp Gin Glu Pro Asn Arg Asp Val Thr Asn Asp Val lie Asn 
85 90 95 

ggt ggt gca tgc aac gag act gaa act agg gtt tct cct tct aat tct 453 
Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 
100 105 110 

tee tct agt gag get gat eac ccc ggt gaa gat tec ggt aag age egg 501 
Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 
115 120 125 

agg aaa cga gag tta gte ggt gaa gaa gat caa att tec aaa aaa gtt 54 9 

Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gin lie Ser Lys Lys Val 
130 135 140 

ggg aaa acg aaa aag act gag gtg aag aaa caa aga gag cca cga gte 597 
Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gin Arg Glu Pro Arg Val 
145 150 155 160 

teg ttt atg act aaa agt gaa gtt gat eat ett gaa gat ggt tat aga 645 
Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 
165 170 175 

tgg aga aaa tac ggc caa aag get gta aaa aat age cet tat cca agg 693 
Trp Arg Lys Tyr Gly Gin Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 
180 185 190 

agt tac tat aga tgt aca aca caa aag tgc aac gtg aag aaa cga gtg 741 
Ser Tyr Tyr Arg Cys Thr Thr Gin Lys Cys Asn Val Lys Lys Arg Val 
195 200 205 

gag aga teg ttc caa gat cca acg gtt gtg att aca act tac gag ggt 789 
Glu Arg Ser Phe Gin Asp Pro Thr Val Val lie Thr Thr Tyr Glu Gly 
210 215 220 

caa eac aac cac ccg att ccg act aat ctt cga gga agt tct gee gcg 837 
Gin His Asn His Pro lie Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala 
225 230 235 240 

get get atg ttc tec gca gae etc atg act cca aga age ttt gca cat 885 
Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 
245 250 255 

gat atg ttt agg acg gca get tat act aac ggc ggt tct gtg gcg gcg 933 
Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 
260 265 270 

get ttg gat tat gga tat gga caa agt ggt tat ggt agt gtg aat tea 981 
Ala Leu Asp Tyr Gly Tyr Gly Gin Ser Gly Tyr Gly Ser Val Asn Ser 

275 280 285 

aac cct agt tct eac caa gtg tat eat caa ggg ggt gag tat gag etc 1029 
Asn Pro Ser Ser His Gin Val Tyr His Gin Gly Gly Glu Tyr Glu Leu 

290 295 300 



1074 



tegateattg ttataactac atatattata tatattgaga gagagaggta gagaaaaaaa 1134 
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<210> 24 
<211> 318 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 24 

Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gin Tyr Pro Ser Ser 
15 10 15 

Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser Asn Pro Ser Ser 
20 25 30 

Tyr Gly Asn Leu Pro Ser Gin Asn Gly Phe Asn Pro Ser Thr Tyr Ser 
35 40 45 

Phe Thr Asp Cys Leu Gin Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 
50 55 60 

Gin Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 
65 70 75 80 

Ser lie Asp Gin Glu Pro Asn Arg Asp Val Thr Asn Asp Val lie Asn 
85 90 95 

Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 
100 105 110 

Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 
115 120 125 

Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gin lie Ser Lys Lys Val 
130 135 140 

Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gin Arg Glu Pro Arg Val 
145 150 155 160 

Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 
165 170 175 

Trp Arg Lys Tyr Gly Gin Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 
180 185 190 

Ser Tyr Tyr Arg Cys Thr Thr Gin Lys Cys Asn Val Lys Lys Arg Val 
195 200 205 

Glu Arg Ser Phe Gin Asp Pro Thr Val Val lie Thr Thr Tyr Glu Gly 
210 215 220 

Gin His Asn His Pro lie Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala 
225 230 235 240 

Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 
245 250 255 



1136 



Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 
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260 265 270 

Ala Leu Asp Tyr Gly Tyr Gly Gin Ser Gly Tyr Gly Ser Val Asn Ser 
275 280 285 

Asn Pro Ser Ser His Gin Val Tyr His Gin Gly Gly Glu Tyr Glu Leu 
290 295 300 

Leu Arg Glu lie Phe Pro Ser lie Phe Phe Lys Gin Glu Pro 

310 315 



305 




<210> 


25 


<211> 


2580 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2580) 


<223> 


G470 


<400> 


25 



atg gcg agt teg gag gtt tea atg aaa ggt aat cgt gga gga gat aac 
Met Ala Ser Ser Glu Val Ser Met Lys Gly Asn Arg Gly Gly Asp Asn 
15 10 15 



etc tgt cga gtt att aat gta gat tta aag gca gag gca gat aca gat 
Leu Cys Arg Val lie Asn Val Asp Leu Lys Ala Glu Ala Asp Thr Asp 
115 120 125 



48 



ttc tec tec tct ggt ttt agt gae cct aag gag act aga aat gte tec 96 
Phe Ser Ser Ser Gly Phe Ser Asp Pro Lys Glu Thr Arg Asn Val Ser 
20 25 30 

gte gee ggc gag ggg caa aaa agt aat tct ace cga tec get gcg get 144 
Val Ala Gly Glu Gly Gin Lys Ser Asn Ser Thr Arg Ser Ala Ala Ala 

35 40 45 

gag cgt get ttg gae cct gag get get ett tac aga gag eta tgg cac 192 
Glu Arg Ala Leu Asp Pro Glu Ala Ala Leu Tyr Arg Glu Leu Trp His 
50 55 60 

get tgt get ggt ecg ett gtg aeg gtt cct aga caa gae gae cga gte 24 0 

Ala Cys Ala Gly Pro Leu Val Thr Val Pro Arg Gin Asp Asp Arg Val 
65 70 75 80 

ttc tat ttt cct caa gga cac ate gag eag gtg gag get teg aeg aac 288 
Phe Tyr Phe Pro Gin Gly His lie Glu Gin val Glu Ala Ser Thr Asn 
85 90 95 

eag gcg gca gaa caa eag atg cet etc tat gat ett ccg tea aag ett 336 
Gin Ala Ala Glu Gin Gin Met Pro Leu Tyr Asp Leu Pro Ser Lys Leu 
100 105 110 



384 



gaa gtt tat gcg eag att act ett ett cct gag get aat caa gae gag 432 
Glu Val Tyr Ala Gin lie Thr Leu Leu Pro Glu Ala Asn Gin Asp Glu 
130 135 140 

aat gca att gag aaa gaa gcg cct ett cct cca cct ccg agg ttc eag 480 
Asn Ala lie Glu Lys Glu Ala Pro Leu Pro Pro Pro Pro Arg Phe Gin 

145 150 155 160 

gtg cat teg ttc tgc aaa ace ttg act gca tec gae aca agt aca eat 528 
Val His Ser Phe Cys Lys Thr Leu Thr Ala Ser Asp Thr Ser Thr His 
165 170 175 

ggt gga ttt tct gtt ett agg cga cat gcg gat gaa tgt etc eea cct 576 
Gly Gly Phe Ser Val Leu Arg Arg His Ala Asp Glu Cys Leu Pro Pro 
180 185 190 
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ctg gat atg tct cga cag cct ccc act caa gag tta gtt gca aag gat 
Leu Asp Met Ser Arg Gin Pro Pro Thr Gin Glu Leu Val Ala Lys Asp 
195 200 205 

ttg cat gca aat gag tgg cga ttc aga cat ata ttc egg ggt caa cca 
Leu His Ala Asn Glu Trp Arg Phe Arg His lie Phe Arg Gly Gin Pro 
210 215 220 

egg agg cat ttg eta cag agt ggg tgg agt gtg ttt gtt age tec aaa 
Arg Arg His Leu Leu Gin Ser Gly Trp Ser Val Phe Val Ser Ser Lys 
225 230 235 240 

agg eta gtt gca gge gat gcg ttt ata ttt eta agg gge gag aat gga 
Arg Leu Val Ala Gly Asp Ala Phe lie Phe Leu Arg Gly Glu Asn Gly 

245 250 255 

gaa tta aga gtt ggt gta agg egt geg atg cga caa caa gga aac gtg 
Glu Leu Arg Val Gly Val Arg Arg Ala Met Arg Gin Gin Gly Asn Val 
260 265 270 

ccg tct tct gtt ata tct age cat age atg cat ctt gga gta ctg gee 
Pro Ser Ser Val lie Ser Ser His Ser Met His Leu Gly Val Leu Ala 

275 280 285 

ace gca tgg cat gcc att tea aca ggg act atg ttt aea gte tac tac 
Thr Ala Trp His Ala lie Ser Thr Gly Thr Met Phe Thr Val Tyr Tyr 
290 295 300 

aaa cce agg aeg age eca tet gag ttt att gtt ceg tte gat cag tat 
Lys Pro Arg Thr Ser Pro Ser Glu Phe lie Val Pro Phe Asp Gin Tyr 
305 310 315 320 

atg gag tct gtt aag aat aac tac tct att gge atg aga ttc aaa atg 
Met Glu Ser Val Lys Asn Asn Tyr Ser He Gly Met Arg Phe Lys Met 
325 330 335 

aga ttt gaa gge gaa' gag get cct gag cag agg ttt act gge aca ate 
Arg Phe Glu Gly Glu Glu Ala Pro Glu Gin Arg Phe Thr Gly Thr He 
340 345 350 

gtt ggg att gaa gag tet gat cct act agg tgg cca aaa tea aag tgg 
Val Gly He Glu Glu Ser Asp Pro Thr Arg Trp Pro Lys Ser Lys Trp 
355 360 365 

aga tec etc aag gtg aga tgg gat gag act tet agt att eet cga cct 
Arg Ser Leu Lys Val Arg Trp Asp Glu Thr Ser Ser lie Pro Arg Pro 
370 375 380 

gat aga gta tct ceg tgg aaa gta gag cca get ctt get cct cct get 
Asp Arg Val Ser Pro Trp Lys Val Glu Pro Ala Leu Ala Pro Pro Ala 

385 390 395 400 

ttg agt eet gtt cca atg cct agg cct aag agg ccc aga tea aat ata 
Leu Ser Pro Val Pro Met Pro Arg Pro Lys Arg Pro Arg Ser Asn He 
405 410 415 

gca eet tea tet cct gae tct teg atg ctt ace aga gaa ggt aea act 
Ala Pro Ser Ser Pro Asp Ser Ser Met Leu Thr Arg Glu Gly Thr Thr 
420 425 430 

aag gca aac atg gae cct tta cca gca age gga ctt tea agg gtc ttg 
Lys Ala Asn Met Asp Pro Leu Pro Ala Ser Gly Leu Ser Arg Val Leu 
435 440 445 

caa ggt caa gaa tac teg acc ttg agg aeg aaa cat act gag agt gta 
Gin Gly Gin Glu Tyr Ser Thr Leu Arg Thr Lys His Thr Glu Ser Val 
450 455 460 

gag tgt gat get cct gag aat tct gtt gtc tgg caa tct tea gcg gat 
Glu Cys Asp Ala Pro Glu Asn Ser Val Val Trp Gin Ser Ser Ala Asp 
465 470 475 480 

gat gat aag gtt gae gtg gtt teg ggt tct aga aga tat gga tet gag 
Asp Asp Lys Val Asp Val Val Ser Gly Ser Arg Arg Tyr Gly Ser Glu 
485 490 495 
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aac tgg atg tec tea gcc agg cat gaa cct act tac aca gat ttg etc 1536 
Asn Trp Met Ser Ser Ala Arg His Glu Pro Thr Tyr Thr Asp Leu Leu 

500 505 510 

tec ggc ttt ggg act aac ata gat cca tec cat ggt cag egg ata cct 1584 
Ser Gly Phe Gly Thr Asn lie Asp Pro Ser His Gly Gin Arg lie Pro 
515 520 525 

ttt tat gae cat tea tea tea cct tct atg cct gca aag aga ate ttg 1632 
Phe Tyr Asp His Ser Ser Ser Pro Ser Met Pro Ala Lys Arg lie Leu 
530 535 540 

agt gat tea gaa ggc aag ttc gat tat ctt get aac cag tgg cag atg 1680 
Ser Asp Ser Glu Gly Lys Phe Asp Tyr Leu Ala Asn Gin Trp Gin Met 
545 550 555 560 

ata cac tct ggt etc tec etg aag tta cat gaa tct cct aag gta cct 1728 
He His Ser Gly Leu Ser Leu Lys Leu His Glu Ser Pro Lys Val Pro 
565 570 575 

gca gca act gat gcg tct etc caa ggg cga tgc aat gtt aaa tac age 1776 
Ala Ala Thr Asp Ala Ser Leu Gin Gly Arg Cys Asn Val Lys Tyr Ser 
580 585 590 

gaa tat cct gtt ctt aat ggt eta teg act gag aat get ggt ggt aac 1824 
Glu Tyr Pro Val Leu Asn Gly Leu Ser Thr Glu Asn Ala Gly Gly Asn 
595 600 605 

tgg cca ata cgt cca cgt get ttg aat tat tat gag gaa gtg gte aat 1872 
Trp Pro lie Arg Pro Arg Ala Leu Asn Tyr Tyr Glu Glu Val Val Asn 
610 615 620 

get caa gcg caa get cag get agg gag caa gta aca aaa caa cec ttc 1920 
Ala Gin Ala Gin Ala Gin Ala Arg Glu Gin Val Thr Lys Gin Pro Phe 

625 630 635 640 

aeg ata caa gag gag aca gca aag tea aga gaa ggg aac tgc agg etc 196 8 

Thr He Gin Glu Glu Thr Ala Lys Ser Arg Glu Gly Asn Cys Arg Leu 
645 650 655 

ttt ggc att cct etg ace aac aac atg aat ggg aca gae tea ace atg 2016 
Phe Gly He Pro Leu Thr Asn Asn Met Asn Gly Thr Asp Ser Thr Met 
660 665 670 

tct cag aga aac aac ttg aat gat get gcg ggg ctt aca cag ata gca 2064 
Ser Gin Arg Asn Asn Leu Asn Asp Ala Ala Gly Leu Thr Gin He Ala 
675 680 685 

tea cca aag gtt cag gae ctt tea gat cag tea aaa ggg tea aaa tea 2112 
Ser Pro Lys Val Gin Asp Leu Ser Asp Gin Ser Lys Gly Ser Lys Ser 
690 695 700 

aca aac gat eat cgt gaa cag gga aga cca ttc cag act aat aat cct 2160 
Thr Asn Asp His Arg Glu Gin Gly Arg Pro Phe Gin Thr Asn Asn Pro 
705 710 715 720 

cat ccg aag gat get caa aeg aaa ace aac tea agt agg agt tgc aca 2208 
His Pro Lys Asp Ala Gin Thr Lys Thr Asn Ser Ser Arg Ser Cys Thr 
725 730 735 

aag gtt cac aag cag gga att gca ctt ggc cgt tea gtg gat ctt tea 2256 
Lys Val His Lys Gin Gly He Ala Leu Gly Arg Ser Val Asp Leu Ser 
740 745 750 

aag ttc caa aac tat gag gag tta gte get gag etg gae agg etg ttt 2304 
Lys Phe Gin Asn Tyr Glu Glu Leu Val Ala Glu Leu Asp Arg Leu Phe 

755 760 765 

gag ttc aat gga gag ttg atg get cct aag aaa gat tgg ttg ata gtt 2352 
Glu Phe Asn Gly Glu Leu Met Ala Pro Lys Lys Asp Trp Leu He Val 
770 775 780 



tac aca gat gaa gag aat gat atg atg ctt gtt ggt gae gat cct tgg 
Tyr Thr Asp Glu Glu Asn Asp Met Met Leu Val Gly Asp Asp Pro Trp 
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785 790 795 800 

cag gag ttt tgt tgc atg gtt cgc aaa ate ttc ata tac acg aaa gag 2448 
Gin Glu Phe Cys Cys Met Val Arg Lys lie Phe lie Tyr Thr Lys Glu 
805 810 815 

gaa gtg agg aag atg aac ccg ggg act tta age tgt agg age gag gaa 24 96 

Glu Val Arg Lys Met Asn Pro Gly Thr Leu Ser Cys Arg Ser Glu Glu 
820 825 830 

gaa gca gtt gtt ggg gaa gga tea gat gca aag gac gee aag tct gca 2544 
Glu Ala Val Val Gly Glu Gly Ser Asp Ala Lys Asp Ala Lys Ser Ala 
835 840 845 

tea aat cct tea ttg tee age get ggg aac tet taa 2580 
Ser Asn Pro Ser Leu Ser Ser Ala Gly Asn Ser 
850 855 

<210> 26 
<211> 859 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 26 

Met Ala Ser Ser Glu Val Ser Met Lys Gly Asn Arg Gly Gly Asp Asn 
15 10 15 

Phe Ser Ser Ser Gly Phe Ser Asp Pro Lys Glu Thr Arg Asn Val Ser 
20 25 30 

Val Ala Gly Glu Gly Gin Lys Ser Asn Ser Thr Arg Ser Ala Ala Ala 
35 40 45 

Glu Arg Ala Leu Asp Pro Glu Ala Ala Leu Tyr Arg Glu Leu Trp His 
50 55 60 

Ala Cys Ala Gly Pro Leu Val Thr Val Pro Arg Gin Asp Asp Arg Val 
65 70 75 80 

Phe Tyr Phe Pro Gin Gly His lie Glu Gin Val Glu Ala Ser Thr Asn 
85 90 95 

Gin Ala Ala Glu Gin Gin Met Pro Leu Tyr Asp Leu Pro Ser Lys Leu 
100 105 110 

Leu Cys Arg Val lie Asn Val Asp Leu Lys Ala Glu Ala Asp Thr Asp 
115 120 125 

Glu Val Tyr Ala Gin lie Thr Leu Leu Pro Glu Ala Asn Gin Asp Glu 
130 135 140 

Asn Ala lie Glu Lys Glu Ala Pro Leu Pro Pro Pro Pro Arg Phe Gin 
145 150 155 160 

Val His Ser Phe Cys Lys Thr Leu Thr Ala Ser Asp Thr Ser Thr His 
165 170 175 

Gly Gly Phe Ser Val Leu Arg Arg His Ala Asp Glu Cys Leu Pro Pro 
180 185 190 



Leu Asp Met Ser Arg Gin Pro Pro Thr Gin Glu Leu Val Ala Lys Asp 
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195 200 205 

Leu His Ala Asn Glu Trp Arg Phe Arg His lie Phe Arg Gly Gin Pro 
210 215 220 

Arg Arg His Leu Leu Gin Ser Gly Trp Ser Val Phe Val Ser Ser Lys 
225 230 235 240 

Arg Leu Val Ala Gly Asp Ala Phe lie Phe Leu Arg Gly Glu Asn Gly 
245 250 255 

Glu Leu Arg Val Gly Val Arg Arg Ala Met Arg Gin Gin Gly Asn Val 
260 265 270 

Pro Ser Ser Val lie Ser Ser His Ser Met His Leu Gly Val Leu Ala 
275 280 285 

Thr Ala Trp His Ala lie Ser Thr Gly Thr Met Phe Thr Val Tyr Tyr 
290 295 300 

Lys Pro Arg Thr Ser Pro Ser Glu Phe lie Val Pro Phe Asp Gin Tyr 
305 310 315 320 

Met Glu Ser Val Lys Asn Asn Tyr Ser lie Gly Met Arg Phe Lys Met 
325 330 335 

Arg Phe Glu Gly Glu Glu Ala Pro Glu Gin Arg Phe Thr Gly Thr lie 
340 345 350 

Val Gly lie Glu Glu Ser Asp Pro Thr Arg Trp Pro Lys Ser Lys Trp 
355 360 365 

Arg Ser Leu Lys Val Arg Trp Asp Glu Thr Ser Ser lie Pro Arg Pro 
370 375 380 

Asp Arg Val Ser Pro Trp Lys Val Glu Pro Ala Leu Ala Pro Pro Ala 
385 390 395 400 

Leu Ser Pro Val Pro Met Pro Arg Pro Lys Arg Pro Arg Ser Asn lie 
405 410 415 

Ala Pro Ser Ser Pro Asp Ser Ser Met Leu Thr Arg Glu Gly Thr Thr 
420 425 430 

Lys Ala Asn Met Asp Pro Leu Pro Ala Ser Gly Leu Ser Arg Val Leu 
435 440 445 

Gin Gly Gin Glu Tyr Ser Thr Leu Arg Thr Lys His Thr Glu Ser Val 
450 455 460 

Glu Cys Asp Ala Pro Glu Asn Ser Val Val Trp Gin Ser Ser Ala Asp 
465 470 475 480 

Asp Asp Lys Val Asp Val Val Ser Gly Ser Arg Arg Tyr Gly Ser Glu 
485 490 495 
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Asn Trp Met Ser Ser Ala Arg His Glu Pro Thr Tyr Thr Asp Leu Leu 
500 505 510 

Ser Gly Phe Gly Thr Asn lie Asp Pro Ser His Gly Gin Arg lie Pro 
515 520 525 

Phe Tyr Asp His Ser Ser Ser Pro Ser Met Pro Ala Lys Arg lie Leu 
530 535 540 

Ser Asp Ser Glu Gly Lys Phe Asp Tyr Leu Ala Asn Gin Trp Gin Met 
545 550 555 560 

lie His Ser Gly Leu Ser Leu Lys Leu His Glu Ser Pro Lys Val Pro 
565 570 575 

Ala Ala Thr Asp Ala Ser Leu Gin Gly Arg Cys Asn Val Lys Tyr Ser 
580 585 590 

Glu Tyr Pro Val Leu Asn Gly Leu Ser Thr Glu Asn Ala Gly Gly Asn 
595 600 605 

Trp Pro lie Arg Pro Arg Ala Leu Asn Tyr Tyr Glu Glu Val Val Asn 
610 615 620 

Ala Gin Ala Gin Ala Gin Ala Arg Glu Gin Val Thr Lys Gin Pro Phe 
625 630 635 640 

Thr lie Gin Glu Glu Thr Ala Lys Ser Arg Glu Gly Asn Cys Arg Leu 
645 650 655 

Phe Gly lie Pro Leu Thr Asn Asn Met Asn Gly Thr Asp Ser Thr Met 
660 665 670 

Ser Gin Arg Asn Asn Leu Asn Asp Ala Ala Gly Leu Thr Gin lie Ala 
675 680 685 

Ser Pro Lys Val Gin Asp Leu Ser Asp Gin Ser Lys Gly Ser Lys Ser 
690 695 700 

Thr Asn Asp His Arg Glu Gin Gly Arg Pro Phe Gin Thr Asn Asn Pro 
705 710 715 720 

His Pro Lys Asp Ala Gin Thr Lys Thr Asn Ser Ser Arg Ser Cys Thr 
725 730 735 

Lys Val His Lys Gin Gly He Ala Leu Gly Arg Ser Val Asp Leu Ser 
740 745 750 

Lys Phe Gin Asn Tyr Glu Glu Leu Val Ala Glu Leu Asp Arg Leu Phe 
755 760 765 

Glu Phe Asn Gly Glu Leu Met Ala Pro Lys Lys Asp Trp Leu He Val 
770 775 780 

Tyr Thr Asp Glu Glu Asn Asp Met Met Leu Val Gly Asp Asp Pro Trp 
785 790 795 800 
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Gin Glu Phe Cys Cys Met Val Arg Lys lie Phe He Tyr Thr Lys Glu 
805 810 815 

Glu Val Arg Lys Met Asn Pro Gly Thr Leu Ser Cys Arg Ser Glu Glu 
820 825 830 

Glu Ala Val Val Gly Glu Gly Ser Asp Ala Lys Asp Ala Lys Ser Ala 
835 840 845 

Ser Asn Pro Ser Leu Ser Ser Ala Gly Asn Ser 
850 855 



<210> 


27 


<211> 


1519 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(197) , . (1252) 


<223> 


G615 


<400> 


27 



ttttttcttt tctttctttt tttgctggtg tgagaaattg tacgcttact atctctctct 60 

ctctctgcca gattctctct ttttgatgat gtgaaagttg tgcttttgtt tcttaagaaa 120 

aaggcatatt tttaatactt gattcttggt tcttgattct tgattcttgg ttttttttag 180 

cttcttaagt tcggtg atg teg tct tec acc aat gac tac aac gat ggt aat 232 
Met Ser Ser Ser Thr Asn Asp Tyr Asn Asp Gly Asn 
15 10 

aac aat gga gtg tac cct etc tct ctt tac ctt tct tea etc tct ggc 280 
Asn Asn Gly Val Tyr Pro Leu Ser Leu Tyr Leu Ser Ser Leu Ser Gly 
15 20 25 

cat caa gac ate att cat aat cec tac aac eat eag tta aaa gca tct 328 
His Gin Asp He He His Asn Pro Tyr Asn His Gin Leu Lys Ala Ser 
30 35 40 

ecg ggc cat atg gta tea gca gtt cct gaa tct ctg ate gat tac atg 376 
Pro Gly His Met Val Ser Ala Val Pro Glu Ser Leu He Asp Tyr Met 
45 50 55 60 

gcg ttt aag tea aat aat gtt gtg aat caa caa ggc ttt gag ttt cct 424 
Ala Phe Lys Ser Asn Asn Val Val Asn Gin Gin Gly Phe Glu Phe Pro 
65 70 75 

gag gtg tea aag gaa ate aag aag gtg gtg aag aag gac cga eat age 472 
Glu Val Ser Lys Glu He Lys Lys Val Val Lys Lys Asp Arg His Ser 
80 85 90 

aag att caa acg gca caa ggg att aga gac agg agg gtt agg ctt ttt 520 
Lys He Gin Thr Ala Gin Gly He Arg Asp Arg Arg Val Arg Leu Phe 

95 100 105 

att ggg att get egc caa ttc ttt gat ctt eag gat atg ttg ggg ttt 568 
He Gly He Ala Arg Gin Phe Phe Asp Leu Gin Asp Met Leu Gly Phe 
110 115 120 

gat aaa get agt aaa acg tta gac tgg ctg etc aag aag tea aga aaa 616 
Asp Lys Ala Ser Lys Thr Leu Asp Trp Leu Leu Lys Lys Ser Arg Lys 
125 130 135 140 

gee ate aaa gag gtc gta caa gca aaa aac etc aac aat gat gat gaa 664 
Ala lie Lys Glu Val Val Gin Ala Lys Asn Leu Asn Asn Asp Asp Glu 
145 150 155 
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gat ttt gga aac att gga ggc gat gta gaa caa gaa gag gag aag gag 712 
Asp Phe Gly Asn lie Gly Gly Asp Val Glu Gin Glu Glu Glu hys Glu 
160 165 170 

gag gat gac aat ggc gat aag age ttc gtg tat ggt ttg age ccc ggg 760 
Glu Asp Asp Asn Gly Asp Lys Ser Phe Val Tyr Gly Leu Ser Pro Gly 
175 180 185 

tac ggt gaa gaa gaa gtg gta tgt gag gcc acg aag gca ggg ata aga 808 
Tyr Gly Glu Glu Glu Val Val Cys Glu Ala Thr Lys Ala Gly lie Arg 
190 195 200 

aag aag aag agt gag ttg aga aac ate tea tea aag ggg eta gga gcc 856 
Lys Lys Lys Ser Glu Leu Arg Asn lie Ser Ser Lys Gly Leu Gly Ala 
205 210 215 220 

aaa get aga gga aaa gea aag gag ega aea aaa gag atg atg gcc tat 904 
Lys Ala Arg Gly Lys Ala Lys Glu Arg Thr Lys Glu Met Met Ala Tyr 

225 230 235 

gat aat cea gag act gee tct gat att aca caa tet gaa ate atg gae 952 
Asp Asn Pro Glu Thr Ala Ser Asp He Thr Gin Ser Glu He Met Asp 
240 245 250 

cea tte aag agg tct ata gtc ttc aat gaa gga gaa gat atg aea cac 1000 
Pro Phe Lys Arg Ser He Val Phe Asn Glu Gly Glu Asp Met Thr His 
255 260 265 

ctt ttc tac aag gaa cea ate gag gag ttt gat aat caa gaa tet ate 1048 
Leu Phe Tyr Lys Glu Pro He Glu Glu Phe Asp Asn Gin Glu Ser He 
270 275 280 

tta acc aat atg act eta cea acg aag atg ggt caa agt tac aat caa 1096 
Leu Thr Asn Met Thr Leu Pro Thr Lys Met Gly Gin Ser Tyr Asn Gin 
285 290 295 300 

aat aat ggg ata ctt atg ttg gta gat cag agt tct age age aac tat 1144 
Asn Asn Gly He Leu Met Leu Val Asp Gin Ser Ser Ser Ser Asn Tyr 
305 310 315 

aat aca ttt ctg cct caa aat ttg gat tat agt tat gat caa aac cet 1192 
Asn Thr Phe Leu Pro Gin Asn Leu Asp Tyr Ser Tyr Asp Gin Asn Pro 
320 325 330 

ttt cat gac caa acc tta tat gta gtc acc gac aaa aat ttc ccc aaa 1240 
Phe His Asp Gin Thr Leu Tyr Val Val Thr Asp Lys Asn Phe Pro Lys 

335 340 345 

ggt tte eta taa atctcgacag ttttgaagga etatgeatga teaagtttaa 1292 
Gly Phe Leu 

350 

acatgtaagc caatatagtc cettattcet ctgaatgtat acaaaatcta tagttatgta 1352 

tatctgttcc tttttaacgt atctttattg atcttetgtg cettgatcaa aattgteatt 1412 

ttaagattea gtttgtgtaa tattttaget acaaetttta agtggtatta ttgtaacctt 1472 

ttgaactata tattttgaag atgaataaga aeatgtttat ataaaaa 1519 

<210> 28 
<211> 351 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 28 

Met Ser Ser Ser Thr Asn Asp Tyr Asn Asp Gly Asn Asn Asn Gly Val 
15 10 15 

Tyr Pro Leu Ser Leu Tyr Leu Ser Ser Leu Ser Gly His Gin Asp lie 
20 25 30 
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lie His Asn Pro Tyr Asn His Gin Leu Lys Ala Ser Pro Gly His Met 
35 40 45 

Val Ser Ala Val Pro Glu Ser Leu lie Asp Tyr Met Ala Phe Lys Ser 
50 55 60 

Asn Asn Val Val Asn Gin Gin Gly Phe Glu Phe Pro Glu Val Ser Lys 
65 70 75 80 

Glu lie Lys Lys Val Val Lys Lys Asp Arg His Ser Lys lie Gin Thr 
85 90 95 

Ala Gin Gly lie Arg Asp Arg Arg Val Arg Leu Phe lie Gly lie Ala 
100 105 110 

Arg Gin Phe Phe Asp Leu Gin Asp Met Leu Gly Phe Asp Lys Ala Ser 
115 120 125 

Lys Thr Leu Asp Trp Leu Leu Lys Lys Ser Arg Lys Ala lie Lys Glu 
130 135 140 

Val Val Gin Ala Lys Asn Leu Asn Asn Asp Asp Glu Asp Phe Gly Asn 
145 150 .155 160 

lie Gly Gly Asp Val Glu Gin Glu Glu Glu Lys Glu Glu Asp Asp Asn 
165 170 175 

Gly Asp Lys Ser Phe Val Tyr Gly Leu Ser Pro Gly Tyr Gly Glu Glu 
180 185 190 

Glu Val Val Cys Glu Ala Thr Lys Ala Gly lie Arg Lys Lys Lys Ser 
195 200 205 

Glu Leu Arg Asn lie Ser Ser Lys Gly Leu Gly Ala Lys Ala Arg Gly 
210 215 220 

Lys Ala Lys Glu Arg Thr Lys Glu Met Met Ala Tyr Asp Asn Pro Glu 
225 230 235 240 

Thr Ala Ser Asp lie Thr Gin Ser Glu lie Met Asp Pro Phe Lys Arg 
245 250 255 

Ser lie Val Phe Asn Glu Gly Glu Asp Met Thr His Leu Phe Tyr Lys 
260 265 270 

Glu Pro lie Glu Glu Phe Asp Asn Gin Glu Ser lie Leu Thr Asn Met 
275 280 285 

Thr Leu Pro Thr Lys Met Gly Gin Ser Tyr Asn Gin Asn Asn Gly lie 
290 295 300 

Leu Met Leu Val Asp Gin Ser Ser Ser Ser Asn Tyr Asn Thr Phe Leu 
305 310 315 320 

Pro Gin Asn Leu Asp Tyr Ser Tyr Asp Gin Asn Pro Phe His Asp Gin 
325 330 335 

Page 4 8 



wo 01/36444 



PCT/USOO/31325 



MBI0018 Sequence Listing. ST25 

Thr Leu Tyr Val Val Thr Asp Lys Asn Phe Pro Lys Gly Phe Leu 
340 345 350 



<210> 


29 


<211> 


974 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(62) - . (874) 


<223> 


G1073 


<400> 


29 



ccccccgacc tgcctctaca gagacctgaa gattccagaa ccccacctga tcaaaaataa 60 

c atg gaa ctt aac aga tct gaa gca gac gaa gca aag gcc gag acc act 109 
Met Glu Leu Asn Arg Ser Glu Ala Asp Glu.Ala Lys Ala Glu Thr Thr 
15 10 15 

ccc acc ggt gga gcc acc age tea gcc aca gcc tct ggc tct tec tec 157 
Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

gga cgt cgt cca cgt ggt cgt cct gca ggt tec aaa aac aaa ccc aaa 205 
Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

cct ccg acg att ata act aga gat agt cct aac gtc ctt aga tea cac 253 
Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

gtt ctt gaa gtc acc tec ggt teg gac ata tec gag gca gtc tec acc 301 
Val Leu Glu Val Thr Ser Gly Ser Asp lie Ser Glu Ala Val Ser Thr 
65 70 75 80 

tac gcc act cgt cgc ggc tgc ggc gtt tgc att ata age ggc acg ggt 349 
Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys lie lie Ser Gly Thr Gly 
85 90 95 

gcg gtc act aac gtc acg ata egg caa cct gcg get ccg get ggt gga 397 
Ala Val Thr Asn Val Thr lie Arg Gin Pro Ala Ala Pro Ala Gly Gly 

ICQ 105 110 

ggt gtg att acc ctg cat ggt egg ttt gac att ttg tct ttg acc ggt 445 
Gly Val lie Thr Leu His Gly Arg Phe Asp lie Leu Ser Leu Thr Gly 
115 120 125 

act gcg ctt cca ccg cct gca cca ccg gga gca gga ggt ttg acg gtg 493 
Thr Ala Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 

tat eta gcc gga ggt caa gga caa gtt gta gga ggg aat gtg get ggt 541 
Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

teg tta att get teg gga ccg gta gtg ttg atg get get tct ttt gca 589 
Ser Leu lie Ala Ser Gly Pro Val Val Leu Met Ala Ala Ser Phe Ala 
165 170 175 

aac gca gtt tat gat agg tta ccg att gaa gag gaa gaa acc cca ccg 637 
Asn Ala Val Tyr Asp Arg Leu Pro lie Glu Glu Glu Glu Thr Pro Pro 
180 185 190 

ccg aga acc acc ggg gtg cag cag cag cag ccg gag gcg tct cag teg 685 
Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 

teg gag gtt acg ggg agt ggg gcc cag gcg tgt gag tea aac etc caa 733 
Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 220 
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ggt gga aat ggt gga gga ggt gtt get ttc tac aat ctt gga atg aat 781 
Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Asn Leu Gly Met Asn 
225 230 235 240 

atg aac aat ttt caa ttc tec ggg gga gat att tac ggt atg age ggc 829 
Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 
245 250 255 

ggt age gga gga ggt ggt ggc ggt gcg act aga ccc gcg ttt tag 874 
Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 

agttttagcg ttttggtgac accttttgtt gcgtttgcgt gtttgacctc aaactactag 934 

gctactagct atagcggttg cgaaatgcga atattaggtt 974 

<210> 30 
<211> 270 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 30 

Met Glu Leu Asn Arg Ser Glu Ala Asp Glu Ala Lys Ala Glu Thr Thr 
15 10 15 

Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

Val Leu Glu Val Thr Ser Gly Ser Asp lie Ser Glu Ala Val Ser Thr 
65 70 75 80 

Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys lie lie Ser Gly Thr Gly 
85 90 95 

Ala Val Thr Asn Val Thr lie Arg Gin Pro Ala Ala Pro Ala Gly Gly 
100 105 110 

Gly Val lie Thr Leu His Gly Arg Phe Asp lie Leu Ser Leu Thr Gly 
115 120 125 

Thr Ala Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 

Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

Ser Leu He Ala Ser Gly Pro Val Val Leu Met Ala Ala Ser Phe Ala 
165 170 175 

Asn Ala Val Tyr Asp Arg Leu Pro He Glu Glu Glu Glu Thr Pro Pro 
180 185 190 

Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 
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Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 220 

Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Asn Leu Gly Met Asn 
225 230 235 240 

Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 
245 250 255 

Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 



<210> 


31 


<211> 


2010 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2010) 


<223> 


G1493 


<400> 


31 



atg atg aat ccg agt cac gga aga gga etc gga teg get ggt ggg tec 48 
Met Met Asn Pro Ser His Gly Arg Gly Leu Gly Ser Ala Gly Gly Ser 
15 10 15 

age tee ggt aga aat caa gga ggt ggt ggt gag ace gtc gtc gag atg 96 
Ser Ser Gly Arg Asn Gin Gly Gly Gly Gly Glu Thr Val Val Glu Met 
20 25 30 

ttt cct tct ggt ctt cga gtt ctt gtc gtt gac gat gac cca act tgt 144 
Phe Pro Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys 
35 40 45 

etc atg ate tta gag agg atg ctt agg act tgt ctt tac gaa gta acg 192 
Leu Met lie Leu Glu Arg Met Leu Arg Thr Cys Leu Tyr Glu Val Thr 
50 55 60 

aaa tgc aac aga gca gag atg gca ttg tct ctg etc egg aag aac aaa 24 0 

Lys Cys Asn Arg Ala Glu Met Ala Leu Ser Leu Leu Arg Lys Asn Lys 
65 70 75 80 

cat gga ttc gat ata gta ate agt gat gtt cat atg cct gac atg gac 288 
His Gly Phe Asp lie Val He Ser Asp Val His Met Pro Asp Met Asp 
85 90 95 

ggt ttc aag ctt ctt gag cat gtt ggt eta gag atg gac tta cct gtt 336 
Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val 
100 105 110 

ate atg atg tct gcg gat gat tea aag agt gtg gtt eta aag gga gta 384 
lie Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val 
115 120 125 

acg cac ggt gcg gtt gat tac ctt ate aag cct gta cgt atg gag gca 432 
Thr His Gly Ala Val Asp Tyr Leu He Lys Pro Val Arg Met Glu Ala 
130 135 140 

ctt aag aac ata tgg eag cat gta gtt agg aag agg aga agt gaa tgg 480 
Leu Lys Asn He Trp Gin His Val Val Arg Lys Arg Arg Ser Glu Trp 
145 150 155 160 

agt gta ccg gaa cat tct ggg age att gag gag act ggc gag aga eag 528 
Ser Val Pro Glu His Ser Gly Ser He Glu Glu Thr Gly Glu Arg Gin 
165 170 175 

cag cag caa cat aga gga ggt ggt ggt ggt gca get gtt tct ggt gga 576 
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Gin Gin Gin His Arg Gly Gly Gly Gly Gly Ala Ala Val Ser Gly Gly 
180 185 190 

gag gat gcg gtg gat gat aac tea tec teg gtt aac gaa ggt aae aat 624 
Glu Asp Ala Val Asp Asp Asn Ser Ser Ser Val Asn Glu Gly Asn Asn 
195 200 205 

tgg agg age agt tea egg aag agg aaa gae gag gaa gga gaa gag caa 672 
Trp Arg Ser Ser Ser Arg Lys Arg Lys Asp Glu Glu Gly Glu Glu Gin 
210 215 220 

gga gat gat aag gac gaa gat gcg teg aat ttg aag aaa ccg cgt gtc 720 
Gly Asp Asp Lys Asp Glu Asp Ala Ser Asn Leu Lys Lys Pro Arg Val 
225 230 235 240 

gtc tgg tct gtt gaa ttg cat cag eag ttt gtt get get gtt aat cag 768 
Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
245 250 255 

etc ggc gtt gag aag gcg gtt cct aaa aag ate tta gag ctg atg aat 816 
Leu Gly Val Glu Lys Ala Val Pro Lys Lys lie Leu Glu Leu Met Asn 
260 265 270 

gtt cct ggt eta acc ega gaa aac gta gca agt cac etc cag aaa tac 864 
Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys Tyr 
275 280 285 

egg ata tat eta aga egg ctt gga ggg gta teg cag cac caa ggc aat 912 
Arg lie Tyr Leu Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn 
290 295 300 

ctt aac aac teg ttt atg acg ggt eag gat gcg age tte gga cct ctt 960 
Leu Asn Asn Ser Phe Met Thr Gly Gin Asp Ala Ser Phe Gly Pro Leu 
305 310 315 320 

teg aca ttg aat ggg ttt gat ctt caa gca eta gee gtc aca ggt cag 1008 
Ser Thr Leu Asn Gly Phe Asp Leu Gin Ala Leu Ala Val Thr Gly Gin 
325 330 335 

tta cct gca cag agt ctt gca eag ctt caa gee get ggt tta ggc egg 1056 
Leu Pro Ala Gin Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg 
340 345 350 

cct gcg atg gtc tct aag tea ggt ttg ccg gtt tec tec att gtg gat 1104 
Pro Ala Met Val Ser Lys Ser Gly Leu Pro Val Ser Ser lie Val Asp 
355 360 365 

gag aga age ate tte age ttt gac aac acg aaa aca aga ttt gga gaa 1152 
Glu Arg Ser lie Phe Ser Phe Asp Asn Thr Lys Thr Arg Phe Gly Glu 
370 375 380 

ggg ctt ggg eat cac ggg caa caa ccc caa cag caa cca cag atg aac 1200 
Gly Leu Gly His His Gly Gin Gin Pro Gin Gin Gin Pro Gin Met Asn 
385 390 395 400 

tta ctt cac ggt gtc ccc acg ggt tta caa cag cag ctt cct atg ggt 1248 
Leu Leu His Gly Val Pro Thr Gly Leu Gin Gin Gin Leu Pro Met Gly 
405 410 415 

aat ega atg agt att caa caa eag att get get' gtt ega get gga aat 1296 
Asn Arg Met Ser lie Gin Gin Gin lie Ala Ala Val Arg Ala Gly Asn 
420 425 430 

agt gtt caa aac aac gga atg ctg atg cct eta gcg ggt cag cag tct 1344 
Ser Val Gin Asn Asn Gly Met Leu Met Pro Leu Ala Gly Gin Gin Ser 

435 440 445 

ttg cct egg gga cca ccg cct atg eta ace tct teg caa tea tec ate 13 92 

Leu Pro Arg Gly Pro Pro Pro Met Leu Thr Ser Ser Gin Ser Ser lie 
450 455 460 



agg eag ccg atg tta tea aae egc att tec gag aga agt ggt tte tct 
Arg Gin Pro Met Leu Ser Asn Arg lie Ser Glu Arg Ser Gly Phe Ser 
465 470 475 480 



1440 
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gga agg aac aat ate ccc gag age age aga gtg tta ccg aea agt tac 1488 
Gly Arg Asn Asn lie Pro Glu Ser Ser Arg Val Leu Pro Thr Ser Tyr 
485 490 495 

act aat etc aca aea caa cac tea tea age teg atg cct tat aac aac 1536 
Thr Asn Leu Thr Thr Gin His Ser Ser Ser Ser Met Pro Tyr Asn Asn 
500 505 510 

ttc caa cca gaa ctt ccc gtg aac agt ttc ccg ctg gca agt gca cca 1584 
Phe Gin Pro Glu Leu Pro Val Asn Ser Phe Pro Leu Ala Ser Ala Pro 
515 520 525 

999 ata tea gta ccg gtt egg aaa gcc act tct tac cag gaa gag gtt 1632 
Gly lie Ser Val Pro Val Arg Lys Ala Thr Ser Tyr Gin Glu Glu Val 
530 535 540 



aac age tec gaa gcg ggt ttc att aeg ccg age tac gae atg ttc ace 
Asn Ser Ser Glu Ala Gly Phe lie Thr Pro Ser Tyr Asp Met Phe Thr 
545 550 555 560 



1680 



ace aga cag aat gat tgg gat ctg agg aat att gga ata gcc ttt gae 1728 
Thr Arg Gin Asn Asp Trp Asp Leu Arg Asn lie Gly lie Ala Phe Asp 
565 570 575 

tea cat cag gae tea gaa tee get geg ttt tec get tea gaa gee tac 1776 
Ser His Gin Asp Ser Glu Ser Ala Ala Phe Ser Ala Ser Glu Ala Tyr 
580 585 590 

tct tct teg tec atg tea aga cac aac aeg aca gtt gca gee ace gag 1824 
Ser Ser Ser Ser Met Ser Arg His Asn Thr Thr Val Ala Ala Thr Glu 
595 600 605 

eat gge ega aae eae eag cag eea eca teg gga atg gta cag eae eat 1872 
His Gly Arg Asn His Gin Glri Pro Pro Ser Gly Met Val Gin His His 
610 615 620 

eag gtt tat gea gae gga aae ggt ggt tea gtg agg gtg aaa tea gag 1920 
Gin Val Tyr Ala Asp Gly Asn Gly Gly Ser Val Arg Val Lys Ser Glu 
625 630 635 640 

aga gtg get aeg gat aca gca aca atg gcg ttt cac gag cag tat agt 1968 
Arg Val Ala Thr Asp Thr Ala Thr Met Ala Phe His Glu Gin Tyr Ser 
645 650 655 

aat caa gaa gat ctt atg age gca ctt ctt aag cag gtt tga 2010 
Asn Gin Glu Asp Leu Met Ser Ala Leu Leu Lys Gin Val 
660 665 

<210> 32 
<211> 669 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 32 

Met Met Asn Pro Ser His Gly Arg Gly Leu Gly Ser Ala Gly Gly Ser 
15 10 15 

Ser Ser Gly Arg Asn Gin Gly Gly Gly Gly Glu Thr Val Val Glu Met 
20 25 30 

Phe Pro Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys 
35 40 45 

Leu Met lie Leu Glu Arg Met Leu Arg Thr Cys Leu Tyr Glu Val Thr 
50 55 60 

Lys Cys Asn Arg Ala Glu Met Ala Leu Ser Leu Leu Arg Lys Asn Lys 
65 70 75 80 
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His Gly Phe Asp lie Val lie Ser Asp Val His Met Pro Asp Met Asp 
85 90 95 

Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val 
100 105 110 

lie Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val 
115 120 125 

Thr His Gly Ala Val Asp Tyr Leu lie Lys Pro Val Arg Met Glu Ala 
130 135 140 

Leu Lys Asn lie Trp Gin His Val Val Arg Lys Arg Arg Ser Glu Trp 
145 150 155 160 

Ser Val Pro Glu His Ser Gly Ser lie Glu Glu Thr Gly Glu Arg Gin 
165 170 175 

Gin Gin Gin His Arg Gly Gly Gly Gly Gly Ala Ala Val Ser Gly Gly 
180 185 190 

Glu Asp Ala Val Asp Asp Asn Ser Ser Ser Val Asn Glu Gly Asn Asn 
195 200 205 

Trp Arg Ser Ser Ser Arg Lys Arg Lys Asp Glu Glu Gly Glu Glu Gin 
210 215 220 

Gly Asp Asp Lys Asp Glu Asp Ala Ser Asn Leu Lys Lys Pro Arg Val 
225 230 235 240 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
245 250 255 

Leu Gly Val Glu Lys Ala Val Pro Lys Lys lie Leu Glu Leu Met Asn 
260 265 270 

Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys Tyr 
275 280 285 

Arg lie Tyr Leu Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn 
290 295 300 

Leu Asn Asn Ser Phe Met Thr Gly Gin Asp Ala Ser Phe Gly Pro Leu 
305 310 315 320 

Ser Thr Leu Asn Gly Phe Asp Leu Gin Ala Leu Ala Val Thr Gly Gin 
325 330 335 

Leu Pro Ala Gin Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg 
340 345 350 

Pro Ala Met Val Ser Lys Ser Gly Leu Pro Val Ser Ser lie Val Asp 
355 360 365 

Glu Arg Ser lie Phe Ser Phe Asp Asn Thr Lys Thr Arg Phe Gly Glu 
370 375 380 
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Gly Leu Gly His His Gly Gin Gin Pro Gin Gin Gin Pro Gin Met Asn 
385 390 395 400 

Leu Leu His Gly Val Pro Thr Gly Leu Gin Gin Gin Leu Pro Met Gly 
405 410 415 

Asn Arg Met Ser lie Gin Gin Gin lie Ala Ala Val Arg Ala Gly Asn 
420 425 430 

Ser Val Gin Asn Asn Gly Met Leu Met Pro Leu Ala Gly Gin Gin Ser 
435 440 445 

Leu Pro Arg Gly Pro Pro Pro Met Leu Thr Ser Ser Gin Ser Ser lie 
450 455 460 

Arg Gin Pro Met Leu Ser Asn Arg lie Ser Glu Arg Ser Gly Phe Ser 
465 470 475 480 

Gly Arg Asn Asn lie Pro Glu Ser Ser Arg Val Leu Pro Thr Ser Tyr 
485 490 495 

Thr Asn Leu Thr Thr Gin His Ser Ser Ser Ser Met Pro Tyr Asn Asn 
500 505 510 

Phe Gin Pro Glu Leu Pro Val Asn Ser Phe Pro Leu Ala Ser Ala Pro 
515 520 525 

Gly lie Ser Val Pro Val Arg Lys Ala Thr Ser Tyr Gin Glu Glu Val 
530 535 540 

Asn Ser Ser Glu Ala Gly Phe lie Thr Pro Ser Tyr Asp Met Phe Thr 
545 550 555 560 

Thr Arg Gin Asn Asp Trp Asp Leu Arg Asn lie Gly lie Ala Phe Asp 
565 570 575 

Ser His Gin Asp Ser Glu Ser Ala Ala Phe Ser Ala Ser Glu Ala Tyr 
580 585 590 

Ser Ser Ser Ser Met Ser Arg His Asn Thr Thr Val Ala Ala Thr Glu 
595 600 605 

His Gly Arg Asn His Gin Gin Pro Pro Ser Gly Met Val Gin His His 
610 615 620 

Gin Val Tyr Ala Asp Gly Asn Gly Gly Ser Val Arg Val Lys Ser Glu 
625 630 635 640 

Arg Val Ala Thr Asp Thr Ala Thr Met Ala Phe His Glu Gin Tyr Ser 
645 650 655 

Asn Gin Glu Asp Leu Met Ser Ala Leu Leu Lys Gin Val 

660 665 

<210> 33 

Page 55 



wo 01/36444 



PCT/USOO/31325 



MBI0018 Sequence Listing. ST25 

<211> 1239 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (6) . . (1091) 

<223> G993 



<400> 33 

caaat atg gaa tac age tgt gta gac gac agt agt aca acg tea gaa tct 

Met Glu Tyr Ser Cys Val Asp Asp Ser Ser Thr Thr Ser Glu Ser 
15 10 15 



50 



etc tec ate tct act act cca aag ccg aca acg acg acg gag aag aaa 98 
Leu Ser lie Ser Thr Thr Pro Lys Pro Thr Thr Thr Thr Glu Lys Lys 
20 25 30 

etc tct tct ccg ccg gcg acg teg atg cgt etc tac aga atg gga age 146 
Leu Ser Ser Pro Pro Ala Thr Ser Met Arg Leu Tyr Arg Met Gly Ser 
35 40 45 

ggc gga age age gte gtt ttg gat tea gag aac gge gtc gag ace gag 194 
Gly Gly Ser Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Thr Glu 
50 55 60 

tea cgt aag ctt cet teg teg aaa tat aaa ggc gtt gtg cct eag cet 242 
Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro 
65 70 75 



aac gga aga tgg gga get eag att tac gag aag eat eag ega gtt tgg 
Asn Gly Arg Trp Gly Ala Gin lie Tyr Glu Lys His Gin Arg Val Trp 
80 85 90 95 



gtt ttg att aac ttg gaa gat aga aca ggg aaa gtg tgg egg tte cgt 
Val Leu lie Asn Leu Glu Asp Arg Thr Gly Lys Val Trp Arg Phe Arg 
240 245 250 255 



290 



etc ggt act tte aac gag gaa gaa gaa get gcg tct tct tac gac ate 338 
Leu Gly Thr Phe Asn Glu Glu Glu Glu Ala Ala Ser Ser Tyr Asp lie 
100 105 110 

gee gtg agg aga tte egc ggc cgc gac gee gte act aae tte aaa tct 386 
Ala Val Arg Arg Phe Arg Gly Arg Asp Ala Val Thr Asn Phe Lys Ser 
115 120 125 

caa gtt gat gga aae gac gee gaa teg get ttt ett gac get eat tet 434 
Gin Val Asp Gly Asn Asp Ala Glu Ser Ala Phe Leu Asp Ala His Ser 
130 135 140 

aaa get gag ate gtg gat atg ttg agg aaa cac act tac gcc gat gag 482 
Lys Ala Glu lie Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu 
145 150 155 

ttt gag eag agt aga egg aag ttt gtt aac ggc gac gga aaa cgc tct 530 
Phe Glu Gin Ser Arg Arg Lys Phe Val Asn Gly Asp Gly Lys Arg Ser 
160 165 170 175 

ggg ttg gag acg gcg acg tac gga aac gac get gtt ttg aga gcg egt 578 
Gly Leu Glu Thr Ala Thr Tyr Gly Asn Asp Ala Val Leu Arg Ala Arg 
180 185 190 

gag gtt ttg tte gag aag act gtt aeg ccg age gac gte ggg aag etg 626 
Glu Val Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu 
195 200 205 

aae egt tta gtg ata ceg aaa caa eae gcg gag aag eat ttt ccg tta 674 
Asn Arg Leu Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu 

210 215 220 

ceg gcg atg aeg aeg gcg atg ggg atg aat ccg tct ccg acg aaa gge 722 
Pro Ala Met Thr Thr Ala Met Gly Met Asn Pro Ser Pro Thr Lys Gly 

225 230 235 



770 
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tac agt tac tgg aac age agt caa agt tac gtg ttg acc aag ggc tgg 818 
Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp 
260 265 270 



age egg ttc gtt aaa gag aag aat ctt cga gcc ggt gat gtg gtt tgt 
Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val Cys 
275 280 285 



866 



ttc gag aga tea aec gga cca gae egg caa ttg tat ate cac tgg aaa 914 
Phe Glu Arg Ser Thr Gly Pro Asp Arg Gin Leu Tyr lie His Trp Lys 
290 295 300 

gtc egg tct agt ccg gtt cag act gtg gtt agg eta ttc gga gtc aac 962 
Val Arg Ser Ser Pro Val Gin Thr Val Val Arg Leu Phe Gly Val Asn 
305 310 315 

att ttc aat gtg agt aac gag aaa cca aac gae gtc gea gta gag tgt 1010 
lie Phe Asn Val Ser Asn Glu Lys Pro Asn Asp Val Ala Val Glu Cys 

320 325 330 335 

gtt ggc aag aag aga tct egg gaa gat gat ttg ttt teg tta ggg tgt 1058 
Val Gly Lys Lys Arg Ser Arg Glu Asp Asp Leu Phe Ser Leu Gly Cys 
340 345 350 

tec aag aag cag geg att ate aac ate ttg tga caaattcttt ttttttggtt 1111 
Ser Lys Lys Gin Ala lie lie Asn He Leu 
355 360 

tttttcttea atttgtttct cetttttcaa tattttgtat tgaaatgaca agttgtaaat 1171 

taggacaaga caagaaaaaa tgacaactag acaaaatagt ttttgtttaa aaaaaaaaaa 1231 

aaaaaaaa 123 9 

<210> 34 
<211> 361 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 34 

Met Glu Tyr Ser Cys Val Asp Asp Ser Ser Thr Thr Ser Glu Ser Leu 
15 10 15 

Ser He Ser Thr Thr Pro Lys Pro Thr Thr Thr Thr Glu Lys Lys Leu 
20 25 30 

Ser Ser Pro Pro Ala Thr Ser Met Arg Leu Tyr Arg Met Gly Ser Gly 
35 40 45 

Gly Ser Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Thr Glu Ser 
50 55 60 

Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro Asn 
65 70 75 80 

Gly Arg Trp Gly Ala Gin He Tyr Glu Lys His Gin Arg Val Trp Leu 
85 90 95 

Gly Thr Phe Asn Glu Glu Glu Glu Ala Ala Ser Ser Tyr Asp He Ala 
100 105 110 

Val Arg Arg Phe Arg Gly Arg Asp Ala Val Thr Asn Phe Lys Ser Gin 
115 120 125 



Val Asp Gly Asn Asp Ala Glu Ser Ala Phe Leu Asp Ala His Ser Lys 
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130 135 140 

Ala Glu lie Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Phe 
145 150 155 160 

Glu Gin Ser Arg Arg Lys Phe Val Asn Gly Asp Gly Lys Arg Ser Gly 
165 170 175 

Leu Glu Thr Ala Thr Tyr Gly Asn Asp Ala Val Leu Arg Ala Arg Glu 
180 185 190 

Val Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn 
195 200 205 

Arg Leu Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu Pro 
210 215 220 

Ala Met Thr Thr Ala Met Gly Met Asn Pro Ser Pro Thr Lys Gly Val 
225 230 235 240 

Leu lie Asn Leu Glu Asp Arg Thr Gly Lys Val Trp Arg Phe Arg Tyr 
245 250 255 

Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp Ser 
260 265 270 

Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val Cys Phe 
275 280 285 

Glu Arg Ser. Thr Gly Pro Asp Arg Gin Leu Tyr lie His Trp Lys Val 
290 295 300 

Arg Ser Ser Pro Val Gin Thr Val Val Arg Leu Phe Gly Val Asn lie 
305 310 315 320 

Phe Asn Val Ser Asn Glu Lys Pro Asn Asp Val Ala Val Glu Cys Val 
325 330 335 

Gly Lys Lys Arg Ser Arg Glu Asp Asp Leu Phe Ser Leu Gly Cys Ser 
340 345 350 

Lys Lys Gin Ala lie lie Asn lie Leu 

360 





355 


<210> 


35 


<211> 


1281 


<212> 


DNA 


<213> 


Arabidopsis thai i ana 


<220> 




<221> 


CDS 


<222> 


(64) . . (1098) 


<223> 


G867 


<400> 


35 



cacaacacaa acacatttct gttttctcca ttgtttcaaa ccataaaaaa aaacacagat 60 

taa atg gaa teg agt age gtt gat gag agt act aca agt aca ggt tec 108 
Met Glu Ser Ser Ser Val Asp Glu Ser Thr Thr Ser Thr Gly Ser 
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ate tgt gaa acc ccg gcg ata act ccg gcg aaa aag teg teg gta ggt 156 
lie Cys Glu Thr Pro Ala He Thr Pro Ala Lys Lys Ser Ser Val Gly 
20 25 30 

aac tta tac agg atg gga age gga tea age gtt gtg tta gat tea gag 204 
Asn Leu Tyr Arg Met Gly Ser Gly Ser Ser Val Val Leu Asp Ser Glu 
35 40 45 

aac gge gta gaa get gaa tet agg aag ctt ceg teg tea aaa tae aaa 252 
Asn Gly Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys 
50 55 60 

99t gtg gtg cca eaa eca aac gga aga tgg gga get eag att tac gag 300 
Gly Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He Tyr Glu 

65 70 75 

aaa cac eag cgc gtg tgg etc ggg aca ttc aac gaa gaa gac gaa gcc 348 
Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala 
80 85 90 95 

get cgt gee tae gac gtc gcg gtt cac agg ttc cgt cgc cgt gac gee 396 
Ala Arg Ala Tyr Asp Val Ala Val His Arg Phe Arg Arg Arg Asp Ala 
100 105 110 

gtc aea aat tte aaa gac gtg aag atg gac gaa gac gag gtc gat ttc 444 
Val Thr Asn Phe Lys Asp Val Lys Met Asp Glu Asp Glu Val Asp Phe 
115 120 125 

ttg aat tet cat teg aaa tet gag ate gtt gat atg ttg agg aaa cat 492 
Leu Asn Ser His Ser Lys Ser Glu He Val Asp Met Leu Arg Lys His 
130 135 140 

act tat aac gaa gag tta gag cag agt aaa egg cgt cgt aat ggt aac 540 
Thr Tyr Asn Glu Glu Leu Glu Gin Ser Lys Arg Arg Arg Asn Gly Asn 
145 150 155 

gga aac atg act agg acg ttg tta acg teg ggg ttg agt aat gat ggt 588 
Gly Asn Met Thr Arg Thr Leu Leu Thr Ser Gly Leu Ser Asn Asp Gly 
160 165 170 175 

gtt tet aeg acg ggg ttt aga teg gcg gag gca etg ttt gag aaa gcg 636 
Val Ser Thr Thr Gly Phe Arg Ser Ala Glu Ala Leu Phe Glu Lys Ala 
180 185 190 

gta aeg eca age gac gtt ggg aag eta aac cgt ttg gtt ata eeg aaa 684 
Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val He Pro Lys 
195 200 205 

eat cac gea gag aaa cat ttt ccg tta ccg tea agt aac gtt tec gtg 732 
His His Ala Glu Lys His Phe Pro Leu Pro Ser Ser Asn Val Ser Val 

210 215 220 

aaa gga gtg ttg ttg aac ttt gag gac gtt aac ggg aaa gtg tgg agg 780 
Lys Gly Val Leu Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg 
225 230 235 

tte cgt tac teg tat tgg aac agt agt cag agt tat gtt ttg act aaa 828 
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys 
240 245 250 255 

ggt tgg age agg ttc gtt aag gag aag aat eta cgt get ggt gac gtg 876 
Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val 
260 265 270 

gtt agt ttc agt aga tet aac ggt cag gat eaa cag ttg tac att ggg 924 
Val Ser Phe Ser Arg Ser Asn Gly Gin Asp Gin Gin Leu Tyr He Gly 
275 280 285 

tgg aag teg aga tec ggg tea gat tta gat gcg ggt egg gtt ttg aga 972 
Trp Lys Ser Arg Ser Gly Ser Asp Leu Asp Ala Gly Arg Val Leu Arg 
290 295 300 

ttg ttc gga gtt aac att tea eeg gag agt tea aga aac gae gtc gta 1020 
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Leu Phe Gly Val Asn lie Ser Pro Glu Ser Ser Arg Asn Asp Val Val 

305 310 315 

gga aac aaa aga gtg aac gat act gag atg tta teg ttg gtg tgt age 1068 
Gly Asn Lys Arg Val Asn Asp Thr Glu Met Leu Ser Leu Val Cys Ser 
320 325 330 335 

aag aag caa cgc ate ttt cac gcc teg taa caactcttet tetttttttt 1118 
Lys Lys Gin Arg lie Phe His Ala Ser 
340 

tettttgttg ttttaataat ttttaaaaac tceattttcg ttttctttat ttgcatcggt 1178 

ttetttcttc ttgtttacca aaggtteatg agttgttttt gttgtattga tgaaetgtaa 123 8 

attttattta taggataaat tttaaaaaaa aaaaaaaaaa aaa 1281 

<210> 36 

<211> 344 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 36 

Met Glu Ser Ser Ser Val Asp Glu Ser Thr Thr Ser Thr Gly Ser lie 
1.5 10 15 

Cys Glu Thr Pro Ala lie Thr Pro Ala Lys Lys Ser Ser Val Gly Asn 
20 25 30 

Leu Tyr Arg Met Gly Ser Gly Ser Ser Val Val Leu Asp Ser Glu Asn 
35 40 45 

Gly Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly 
50 55 60 

Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He Tyr Glu Lys 
65 70 75 80 

His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala 
85 90 95 

Arg Ala Tyr Asp Val Ala Val His Arg Phe Arg Arg Arg Asp Ala Val 
100 105 110 

Thr Asn Phe Lys Asp Val Lys Met Asp Glu Asp Glu Val Asp Phe Leu 
115 120 125 

Asn Ser His Ser Lys Ser Glu He Val Asp Met Leu Arg Lys His Thr 
130 135 140 

Tyr Asn Glu Glu Leu Glu Gin Ser Lys Arg Arg Arg Asn Gly Asn Gly 
145 150 155 160 

Asn Met Thr Arg Thr Leu Leu Thr Ser Gly Leu Ser Asn Asp Gly Val 
165 170 175 

Ser Thr Thr Gly Phe Arg Ser Ala Glu Ala Leu Phe Glu Lys Ala Val 
180 185 190 

Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val He Pro Lys His 
195 200 205 
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His Ala Glu Lys His Phe Pro Leu Pro Ser Ser Asn Val Ser Val Lys 
210 215 220 

Gly Val Leu Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg Phe 
225 230 235 240 

Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly 
245 250 255 

Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val 
260 265 270 

Ser Phe Ser Arg Ser Asn Gly Gin Asp Gin Gin Leu Tyr lie Gly Trp 
275 280 285 

Lys Ser Arg Ser Gly Ser Asp Leu Asp Ala Gly Arg Val Leu Arg Leu 
290 295 300 

Phe Gly Val Asn lie Ser Pro Glu Ser Ser Arg Asn Asp Val Val Gly 
305 310 315 320 

Asn Lys Arg Val Asn Asp Thr Glu Met Leu Ser Leu Val Cys Ser Lys 
325 330 335 

Lys Gin Arg lie Phe His Ala Ser 
340 



<210> 


37 


<211> 


1155 


<212> 


DNA 


<213> 


Arabidopsis thai i ana 


<220> 




<221> 


CDS 


<222> 


(76) . . (1077) 


<223> 


G1930 


<400> 


37 



attcacatta ctaatctctc aagatttcac aattttcttg tgattttctc tcagtttctt 60 

atttcgtttc ataac atg gat gcc atg agt age gta gac gag age tct aca 111 
Met Asp Ala Met Ser Ser Val Asp Glu Ser Ser Thr 
15 10 

act aca gat tec att ccg gcg aga aag tea teg tct ccg gcg agt tta 159 
Thr Thr Asp Ser lie Pro Ala Arg Lys Ser Ser Ser Pro Ala Ser Leu 
15 20 25 

eta tat aga atg gga age gga aca age gtg gta ctt gat tea gag aae 207 
Leu Tyr Arg Met Gly Ser Gly Thr Ser Val Val Leu Asp Ser Glu Asn 
30 35 40 

ggt gtc gaa gtc gaa gtc gaa gcc gaa tea aga aag ctt cct tct tea 255 
Gly Val Glu Val Glu Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser 
45 50 55 60 

aga ttc aaa ggt gtt gtt cct caa cca aat gga aga tgg gga get cag 303 
Arg Phe Lys Gly Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin 
65 70 75 

att tac gag aaa eat eaa cge gtg tgg ctt ggt act tte aae gag gaa 351 
lie Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu 
80 85 90 
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gac gaa gca get cgt get tac gac gtc gcg get eac cgt ttc cgt gge 3 99 

Asp Glu Ala Ala Arg Ala Tyr Asp Val Ala Ala His Arg Phe Arg Gly 

95 100 105 

cgc gat gee gtt aet aat ttc aaa gac acg aeg ttc gaa gaa gag gtt 447 
Arg Asp Ala Val Thr Asn Phe Lys Asp Thr Thr Phe Glu Glu Glu Val 
110 115 120 

gag ttc tta aac gcg eat teg aaa tea gag ate gta gat atg ttg aga 495 
Glu Phe Leu Asn Ala His Ser Lys Ser Glu lie Val Asp Met Leu Arg 
125 130 135 140 

aaa cae aet tae aaa gaa gag tta gae caa agg aaa egt aae egt gae 543 
Lys His Thr Tyr Lys Glu Glu Leu Asp Gin Arg Lys Arg Asn Arg Asp 
145 150 155 

ggt aac gga aaa gag acg acg gcg ttt get ttg get teg atg gtg gtt 591 
Gly Asn Gly Lys Glu Thr Thr Ala Phe Ala Leu Ala Ser Met Val Val 
160 165 170 

atg acg ggg ttt aaa acg gcg gag tta ctg ttt gag aaa acg gta acg 639 
Met Thr Gly Phe Lys Thr Ala Glu Leu Leu Phe Glu Lys Thr Val Thr 
175 180 185 

cca agt gac gtc ggg aaa eta aae cgt tta gtt ata eca aaa cae eaa 687 
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val lie Pro Lys His Gin 
190 195 200 

gcg gag aaa eat ttt eeg tta ceg tta ggt aat aat aae gtc tee gtt 735 
Ala Glu Lys His Phe Pro Leu Pro Leu Gly Asn Asn Asn Val Ser Val 
205 210 215 220 

aaa ggt atg etg ttg aat tte gaa gae gtt aae ggg aaa gtg tgg agg 783 
Lys Gly Met Leu Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg 

225 230 235 

tte egt tac tct tat tgg aat agt agt eaa agt tat gtg ttg ace aaa 831 
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys 
240 245 250 

ggt tgg agt aga ttc gtt aaa gag aag aga ett tgt get ggt gat ttg 879 
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Cys Ala Gly Asp Leu 
255 260 265 

ate agt ttt aaa aga tee aae gat eaa gat eaa aaa tte ttt ate ggg 927 
lie Ser Phe Lys Arg Ser Asn Asp Gin Asp Gin Lys Phe Phe lie Gly 
270 275 280 

tgg aaa teg aaa tec ggg ttg gat eta gag acg ggt egg gtt atg aga 975 
Trp Lys Ser Lys Ser Gly Leu Asp Leu Glu Thr Gly Arg Val Met Arg 
285 290 295 300 

ttg ttt ggg gtt gat att tct tta aac gcc gtc gtt gta gtg aag gaa 1023 
Leu Phe Gly Val Asp lie Ser Leu Asn Ala Val Val Val Val Lys Glu 
305 310 315 

aca aeg gag gtg tta atg teg teg tta agg tgt aag aag eaa cga gtt 1071 
Thr Thr Glu Val Leu Met Ser Ser Leu Arg Cys Lys Lys Gin Arg Val 
320 325 330 

ttg taa taacaattta aeaaettggg aaagaaaaaa aagetttttg attttaattt 1127 
Leu 

etettcaacg ttaatettge tgagatta 1155 

<210> 38 

<211> 333 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 38 
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Met Asp Ala Met Ser Ser Val Asp Glu Ser Ser Thr Thr Thr Asp Ser 
15 10 15 

lie Pro Ala Arg Dys Ser Ser Ser Pro Ala Ser Leu Leu Tyr Arg Met 
20 25 30 

Gly Ser Gly Thr Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Val 
35 40 45 

Glu Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Arg Phe Lys Gly 
50 55 60 

Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin lie Tyr Glu Lys 
65 70 75 80 

His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala 
85 90 95 

Arg Ala Tyr Asp Val Ala Ala His Arg Phe Arg Gly Arg Asp Ala Val 
100 105 110 

Thr Asn Phe Lys Asp Thr Thr Phe Glu Glu Glu Val Glu Phe Leu Asn 
115 120 125 

Ala His Ser Lys Ser Glu lie Val Asp Met Leu Arg Lys His Thr Tyr 
130 135 140 

Lys Glu Glu Leu Asp Gin Arg Lys Arg Asn Arg Asp Gly Asn Gly Lys 
145 150 155 160 

Glu Thr Thr Ala Phe Ala Leu Ala Ser Met Val Val Met Thr Gly Phe 
165 170 175 

Lys Thr Ala Glu Leu Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val 
180 185 190 

Gly Lys Leu Asn Arg Leu Val lie Pro Lys His Gin Ala Glu Lys His 
195 200 205 

Phe Pro Leu Pro Leu Gly Asn Asn Asn Val Ser Val Lys Gly Met Leu 
210 215 220 

Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg Phe Arg Tyr Ser 
225 230 235 240 

Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg 
245 250 255 

Phe Val Lys Glu Lys Arg Leu Cys Ala Gly Asp Leu lie Ser Phe Lys 
260 265 270 

Arg Ser Asn Asp Gin Asp Gin Lys Phe Phe lie Gly Trp Lys Ser Lys 
275 280 285 

Ser Gly Leu Asp Leu Glu Thr Gly Arg Val Met Arg Leu Phe Gly Val 
290 295 300 
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Asp lie Ser Leu Asn Ala Val Val Val Val Lys Glu Thr Thr Glu Val 
305 310 315 320 



Leu Met Ser Ser Leu Arg Cys Lys Lys Gin Arg Val Leu 
325 330 



<210> 


39 


<211> 


1035 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (1035) 


<223> 


G1594 


<400> 


39 



atg tac aat ttc cat teg gcc ggt gat tat tea gat aag teg gtt ctg 48 
Met Tyr Asn Phe His Ser Ala Gly Asp Tyr Ser Asp Lys Ser Val Leu 
15 10 15 

atg atg tea ceg gag agt cte atg ttt cot tec gat tac caa get ttg 96 
Met Met Ser Pro Glu Ser Leu Met Phe Pro Ser Asp Tyr Gin Ala Leu 
20 25 30 

eta tgt tec tec gee ggt gaa aat cgt gtc tct gat gtt ttc gga tec 144 
Leu Cys Ser Ser Ala Gly Glu Asn Arg Val Ser Asp Val Phe Gly Ser 

35 40 45 

gac gag eta etc tea gta gcc gtc tec get ttg teg teg gag gcg get 192 
Asp Glu Leu Leu Ser Val Ala Val Ser Ala Leu Ser Ser Glu Ala Ala 
50 55 60 

teg ate get ccg gag ate cga aga aat gat gat aac gtt tct eta act 240 
Ser lie Ala Pro Glu lie Arg Arg Asn Asp Asp Asn Val Ser Leu Thr 
65 70 75 80 

gtc ate aaa get aaa ate get tgt eat ect teg tat cet ege tta ctt 288 
Val He Lys Ala Lys He Ala Cys His Pro Ser Tyr Pro Arg Leu Leu 
85 90 95 

caa get tac ate gat tgc caa aag gtc gga gca cca ccg gag ata gcg 336 
Gin Ala Tyr lie Asp Cys Gin Lys Val Gly Ala Pro Pro Glu He Ala 
100 105 110 

tgt tta eta gag gag att caa egg gag agt gat gtt tat aag caa gag 384 
Cys Leu Leu Glu Glu He Gin Arg Glu Ser Asp Val Tyr Lys Gin Glu 
115 120 125 

gtt gtt ect tct tct tgc ttt gga get gat cct gag ctt gat gaa ttt 432 
Val Val Pro Ser Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe 
130 135 140 

atg gaa acg tac tgc gat ata tta gtg aaa tac aaa teg gat eta gca 480 
Met Glu Thr Tyr Cys Asp He Leu Val Lys Tyr Lys Ser Asp Leu Ala 
145 150 155 160 

aga ceg ttt gac gag gca acg tgt ttc ttg aac aag att gag atg eag 528 
Arg Pro Phe Asp Glu Ala Thr Cys Phe Leu Asn Lys He Glu Met Gin 
165 170 175 

eta egg aac eta tgt act ggt gtc gag tct gcc agg gga gtt tct ggg 576 
Leu Arg Asn Leu Cys Thr Gly Val Glu Ser Ala Arg Gly Val Ser Gly 
180 185 190 

ggg atg tct cct cat ggg gac aag act att agt cct etc ctg aca aat 624 
Gly Met Ser Pro His Gly Asp Lys Thr He Ser Pro Leu Leu Thr Asn 
195 200 205 

gac aat gga gag gat ggt gta ata tea tct gac gag gaa ctg agt gga 672 
Asp Asn Gly Glu Asp Gly Val He Ser Ser Asp Glu Glu Leu Ser Gly 
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210 215 220 

ggt gat cat gag gta gca gag gat ggg aga caa aga tgt gaa gac egg 720 
Gly Asp His Glu Val Ala Glu Asp Gly Arg Gin Arg Cys Glu Asp Arg 
225 230 235 240 

gac etc aaa gat agg ttg eta cgc aaa ttt gga age cgt att agt act 768 
Asp Leu Lys Asp Arg Leu Leu Arg Lys Phe Gly Ser Arg lie Ser Thr 
245 250 255 

tta aag ctt gag ttc tea aag aag aag aag aaa gga aag tta cca aga 816 
Leu Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg 

260 265 270 

gaa gca aga caa get ctt ctt gat tgg tgg aat etc cat tat aag tgg 864 
Glu Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Leu His Tyr Lys Trp 

275 280 285 

cct tac cct act gaa gga gat aag ata gca tta get gat gca acg ggg 912 
Pro Tyr Pro Thr Glu Gly Asp Lys lie Ala Leu Ala Asp Ala Thr Gly 
290 295 300 

tta gac caa aaa caa ate aac aat tgg ttt ata aac caa agg aaa cgt 960 
Leu Asp Gin Lys Gin lie Asn Asn Trp Phe lie Asn Gin Arg Lys Arg 
305 310 315 320 

eat tgg aag cca tea gag aat atg cct ttc get atg atg gat gat tct 1008 
His Trp Lys Pro Ser Glu Asn Met Pro Phe Ala Met Met Asp Asp Ser 
325 330 335 

agt gga tea ttc ttt ace gag gaa tga 1035 
Ser Gly Ser Phe Phe Thr Glu Glu 
340 

<210> 40 
<211> 344 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 40 

Met Tyr Asn Phe His Ser Ala Gly Asp Tyr Ser Asp Lys Ser Val Leu 
15 10 15 

Met Met Ser Pro Glu Ser Leu Met Phe Pro Ser Asp Tyr Gin Ala Leu 
20 25 30 

Leu Cys Ser Ser Ala Gly Glu Asn Arg Val Ser Asp Val Phe Gly Ser 
35 40 45 

Asp Glu Leu Leu Ser Val Ala Val Ser Ala Leu Ser Ser Glu Ala Ala 
50 55 60 

Ser lie Ala Pro Glu lie Arg Arg Asn Asp Asp Asn Val Ser Leu Thr 
65 70 75 80 

Val lie Lys Ala Lys lie Ala Cys His Pro Ser Tyr Pro Arg Leu Leu 
85 90 95 

Gin Ala Tyr lie Asp Cys Gin Lys Val Gly Ala Pro Pro Glu lie Ala 
100 105 110 

Cys Leu Leu Glu Glu lie Gin Arg Glu Ser Asp Val Tyr Lys Gin Glu 
115 120 125 

Val Val Pro Ser Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe 
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130 135 140 

Met Glu Thr Tyr Cys Asp He Leu Val Lys Tyr Lys Ser Asp Leu Ala 
145 150 155 160 

Arg Pro Phe Asp Glu Ala Thr Cys Phe Leu Asn Lys He Glu Met Gin 
165 170 175 

Leu Arg Asn Leu Cys Thr Gly Val Glu Ser Ala Arg Gly Val Ser Gly 
180 185 190 

Gly Met Ser Pro His Gly Asp Lys Thr lie Ser Pro Leu Leu Thr Asn 
195 200 205 

Asp Asn Gly Glu Asp Gly Val He Ser Ser Asp Glu Glu Leu Ser Gly 
210 215 220 

Gly Asp His Glu Val Ala Glu Asp Gly Arg Gin Arg Cys Glu Asp Arg 
225 230 235 240 

Asp Leu Lys Asp Arg Leu Leu Arg Lys Phe Gly Ser Arg He Ser Thr 
245 250 255 

Leu Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg 
260 265 270 

Glu Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Leu His Tyr Lys Trp 
275 280 285 

Pro Tyr Pro Thr Glu Gly Asp Lys He Ala Leu Ala Asp Ala Thr Gly 
290 295 300 

Leu Asp Gin Lys Gin He Asn Asn Trp Phe He Asn Gin Arg Lys Arg 
305 310 315 320 

His Trp Lys Pro Ser Glu Asn Met Pro Phe Ala Met Met Asp Asp Ser 
325 330 335 

Ser Gly Ser Phe Phe Thr Glu Glu 
340 



<210> 


41 


<211> 


2559 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2259) 


<223> 


G391 


<400> 


41 



atg atg atg gtc cat teg atg age aga gat atg atg aac aga gag teg 4 8 

Met Met Met Val His Ser Met Ser Arg Asp Met Met Asn Arg Glu Ser 
15 10 15 

ccg gat aaa ggg tta gat tec gge aag tat gtg agg tae acg ecg gag 96 
Pro Asp Lys Gly Leu Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu 
20 25 30 
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caa gtg gaa get etc gag aga gtt tac act gag tgt cct aag cca agt 144 
Gin Val Glu Ala Leu Glu Arg Val Tyr Thr Glu Cys Pro Lys Pro Ser 
35 40 45 

tct eta aga aga caa caa etc ata cgt gaa tgt ccg att etc tct aac 192 
Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Pro lie Leu Ser Asn 
50 55 60 

ate gag cct aag cag ate aaa gtt tgg ttt cag aac cgc aga tgt egt 24 0 

lie Glu Pro Lys Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 75 80 

gag aag cag agg aaa gaa get get cgt ctt caa aca gtg aac aga aaa 288 
Glu Lys Gin Arg Lys Glu Ala Ala Arg Leu Gin Thr Val Asn Arg Lys 
85 90 95 

etc aat gcc atg aac aaa etc ttg atg gaa gag aat gat cgt ttg cag 33 6 

Leu Asn Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

aag caa gtt tct aac ttg gtc tat gag aat ggc cac atg aaa cat caa 384 
Lys Gin Val Ser Asn Leu Val Tyr Glu Asn Gly His Met Lys His Gin 
115 120 125 

ctt cac act get tct ggg acg acc aca gac aac age tgt gag tct gtg 432 
Leu His Thr Ala Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val 
130 135 140 

gtc gtg agt ggt cag caa cat caa cag caa aac cca aat cct cag cat 480 
Val Val Ser Gly Gin Gin His Gin Gin Gin Asn Pro Asn Pro Gin His 
145 150 155 160 

cag caa cgt gat get aac aac cca gca gga etc ctt tct ata gea gag 528 
Gin Gin Arg Asp Ala Asn Asn Pro Ala Gly Leu Leu Ser lie Ala Glu 
165 170 175 

gag gee eta gca gag tte ctt tec aag get aca gga act get gtt gac 576 
Glu Ala Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr Ala Val Asp 
180 185 190 

tgg gtt cag atg att ggg atg aag cct ggt ccg gat tct att ggc ata 624 
Trp Val Gin Met lie Gly Met Lys Pro Gly Pro Asp Ser lie Gly lie 
195 200 205 

gtc get att teg cgc aac tgc age gga att gca gca cgt gcc tgc ggc 672 
Val Ala lie Ser Arg Asn Cys Ser Gly lie Ala Ala Arg Ala Cys Gly 
210 215 220 

etc gtg agt tta gaa ccc atg aag gtt get gaa att etc aaa gat cgt 720 
Leu Val Ser Leu Glu Pro Met Lys Val Ala Glu lie Leu Lys Asp Arg 

225 230 235 240 

cca tct tgg etc ega gat tgt cga agt gtg gat act ctg agt gtg ata 768 
Pro Ser Trp Leu Arg Asp Cys Arg Ser Val Asp Thr Leu Ser Val lie 
245 250 255 

cct get gga aac ggt ggg acg ate gag ctt att tac acg cag atg tat 816 
Pro Ala Gly Asn Gly Gly Thr lie Glu Leu lie Tyr Thr Gin Met Tyr 
260 265 270 

get cct acg act tta gca gca get egt gae ttt tgg acg ctg aga tat 864 
Ala Pro Thr Thr Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr 
275 280 285 

age aca tgt ttg gaa gat gga age tat gtg gtt tgt gaa agg teg ctt 912 
Ser Thr Cys Leu Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu 
290 295 300 

act tct gca act ggt ggc ccc act ggg cca cct tct tea aac ttt gtg 960 
Thr Ser Ala Thr Gly Gly Pro Thr Gly Pro Pro Ser Ser Asn Phe Val 
305 310 315 320 

aga get gaa atg aaa eea age ggg ttt etc ate egt cct tgc gat ggt 1008 
Arg Ala Glu Met Lys Pro Ser Gly Phe Leu lie Arg Pro Cys Asp Gly 
325 330 335 
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99t ggt tec att etc cac att gtt gat cat gtt gat ctg gat gcc tgg 
Gly Gly Ser lie Leu His lie Val Asp His Val Asp Leu Asp Ala Trp 
340 345 350 

agt gtc cct gaa gtc atg agg cct etc tat gaa tea teg aag att ctt 
Ser Val Pro Glu Val Met Arg Pro Leu Tyr Glu Ser Ser Lys lie Leu 
355 360 365 

get cag aaa atg act gtt get get ttg aga eat gta aga caa att gca 
Ala Gin Lys Met Thr Val Ala Ala Leu Arg His Val Arg Gin lie Ala 
370 375 380 

caa gaa aca agt gga gaa gtt cag tat ggt gga ggg cge caa cct gcg 
Gin Glu Thr Ser Gly Glu Val Gin Tyr Gly Gly Gly Arg Gin Pro Ala 
385 390 395 400 

gtt tta aga acc ttc agt caa aga etc tgt egg ggt ttc aat gat get 
Val Leu Arg Thr Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala 
405 410 415 

gtt aat ggt ttt gtg gat gat gga tgg tea cea atg ggt age gat ggt 
Val Asn Gly Phe Val Asp Asp Gly Trp Ser Pro Met Gly Ser Asp Gly 
420 425 430 

gca gag gat gtt act gta atg ata aac ttg tec cct ggg aag ttt ggt 
Ala Glu Asp Val Thr Val Met lie Asn Leu Ser Pro Gly Lys Phe Gly 
435 440 445 

ggg tet cag tac ggt aat tea ttc ctt cca age ttt ggt agt ggc gtg 
Gly Ser Gin Tyr Gly Asn Ser Phe Leu Pro Ser Phe Gly Ser Gly Val 

450 455 460 

Ctt tgt gee aag gca tet atg ttg ctt cag aac gtt cca cec get gtg 
Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 
465 470 475 480 

Ctg gtt ega ttc ctt aga gaa cac cga tet gaa tgg get gat tat ggc 
Leu Val Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Tyr Gly 
485 490 495 

gtg gat get tat get get gca teg etc aga gca agt cct ttt get gtt 
Val Asp Ala Tyr Ala Ala Ala Ser Leu Arg Ala Ser Pro Phe Ala Val 
500 505 510 

cct tgt get aga get ggg ggg ttc cca agt aac caa gtc att ctt cct 
Pro Cys Ala Arg Ala Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro 
515 520 525 

ctt gcg cag aca gtt gaa cat gaa gag tea ctt gag gtg gtt aga ctt 
Leu Ala Gin Thr Val Glu His Glu Glu Ser Leu Glu Val Val Arg Leu 
530 535 540 

gaa ggt cac get tac tea cec gaa gac atg ggt tta get egg gat atg 
Glu Gly His Ala Tyr Ser Pro Glu Asp Met Gly Leu Ala Arg Asp Met 
545 550 555 560 

tat ttg eta cag ctt tgt age ggt gtt gat gaa aat gtg gtt gga ggt 
Tyr Leu Leu Gin Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly 
565 570 575 

tgt gca cag ctt gta ttt gcc cct ate gat gaa tea ttt get gat gat 
Cys Ala Gin Leu Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp 

580 585 590 

gca cct ttg ctt cct tec ggt ttc cge ate ata cct ctt gaa cag aaa 
Ala Pro Leu Leu Pro Ser Gly Phe Arg lie He Pro Leu Glu Gin Lys 

595 600 605 

tet act ccg aac ggt gca tet gca aac egt acc ctg gat tta gcc tea 
Ser Thr Pro Asn Gly Ala Ser Ala Asn Arg Thr Leu Asp Leu Ala Ser 
610 615 620 

get tta gaa gga tec aca cgt caa get ggt gaa gee gac cca aat ggc 
Ala Leu Glu Gly Ser Thr Arg Gin Ala Gly Glu Ala Asp Pro Asn Gly 
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1056 
1104 
1152 
1200 
1248 
1296 
1344 
1392 
1440 
1488 
1536 
1584 
1632 
1680 
1728 
1776 
1824 
1872 
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625 630 635 640 

tgt aac ttt agg teg gta eta acc ata gca ttc cag ttc aca ttt gat 1968 
Cys Asn Phe Arg Ser Val Leu Thr lie Ala Phe Gin Phe Thr Phe Asp 
645 650 655 

aac cat tea aga gae agt gtt get tea atg gca cgt cag tac gtg cga 2016 
Asn His Ser Arg Asp Ser Val Ala Ser Met Ala Arg Gin Tyr Val Arg 
660 665 670 

age ata gta gga teg att cag agg gtt get eta gee att get ect cgt 2064 
Ser lie Val Gly Ser lie Gin Arg Val Ala Leu Ala He Ala Pro Arg 
675 680 685 

ect gge tec aat ate agt cea ata tet gtt cec act tee ect gaa get 2112 
Pro Gly Ser Asn He Ser Pro He Ser Val Pro Thr Ser Pro Glu Ala 

690 695 700 

etc act ctg gtc cgt tgg ate tec egg agt tac age ett cae act ggt 2160 
Leu Thr Leu Val Arg Trp He Ser Arg Ser Tyr Ser Leu His Thr Gly 
705 710 715 720 

gca gat etc ttt gga tet gat tet caa acc agt ggt gae aeg ttg ctg 2208 
Ala Asp Leu Phe Gly Ser Asp Ser Gin Thr Ser Gly Asp Thr Leu Leu 
725 730 735 

cat caa etc tgg aat eae tet gat gca ate ttg tgc tgc tec etc aaa 2256 
His Gin Leu Trp Asn His Ser Asp Ala He Leu Cys Cys Ser Leu Lys 
740 745 750 

aca aacgettcac cggttttcae attcgeaaae eaaaccggtt tagaeatgct 2309 
Thr 



ggaaaegaet 


cttgtagcec 


ttcaagaeat 


aatgctagac 


aagaccettg 


acgaacetgg 


2369 


tegtaaagct 


etttgctctg 


agttecceaa 


gateatgeaa 


cagggctatg 


etcatetgee 


2429 


ggeaggagta 


tgtgegteaa 


gcatgggaag 


gatggtatet 


tacgagcagg 


eaacggtgtg 


2489 


gaaagttctt 


gaagaegatg 


aateaaacca 


ctgettagct 


ttcatgtteg 


tgaattggte 


2549 


gttcgtttga 












2559 



<210> 42 
<211> 753 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Met Met Val His Ser Met Ser Arg Asp Met Met Asn Arg Glu Ser 
15 10 15 

Pro Asp Lys Gly Leu Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu 
20 25 30 

Gin Val Glu Ala Leu Glu Arg Val Tyr Thr Glu Cys Pro Lys Pro Ser 
35 40 45 

Ser Leu Arg Arg Gin Gin Leu He Arg Glu Cys Pro He Leu Ser Asn 
50 55 60 

He Glu Pro Lys Gin He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 75 80 

Glu Lys Gin Arg Lys Glu Ala Ala Arg Leu Gin Thr Val Asn Arg Lys 
85 90 95 
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Leu Asn Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

Lys Gin Val Ser Asn Leu Val Tyr Glu Asn Gly His Met Lys His Gin 
115 120 125 

Leu His Thr Ala Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val 
130 135 140 

Val Val Ser Gly Gin Gin His Gin Gin Gin Asn Pro Asn Pro Gin His 
145 150 155 160 

Gin Gin Arg Asp Ala Asn Asn Pro Ala Gly Leu Leu Ser lie Ala Glu 
165 170 175 

Glu Ala Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr Ala Val Asp 
180 185 190 

Trp Val Gin Met lie Gly Met Lys Pro Gly Pro. Asp Ser lie Gly lie 
195 200 205 

Val Ala lie Ser Arg Asn Cys Ser Gly lie Ala Ala Arg Ala Cys Gly 
210 215 220 

Leu Val Ser Leu Glu Pro Met Lys Val Ala Glu lie Leu Lys Asp Arg 
225 230 235 240 

Pro Ser Trp Leu Arg Asp Cys Arg Ser Val Asp Thr Leu Ser Val lie 
245 250 255 

Pro Ala Gly Asn Gly Gly Thr lie Glu Leu lie Tyr Thr Gin Met Tyr 
260 265 270 

Ala Pro Thr Thr Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr 
275 280 285 

Ser Thr Cys Leu Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu 
290 295 300 

Thr Ser Ala Thr Gly Gly Pro Thr Gly Pro Pro Ser Ser Asn Phe Val 
305 310 315 320 

Arg Ala Glu Met Lys Pro Ser Gly Phe Leu lie Arg Pro Cys Asp Gly 
325 330 335 

Gly Gly Ser lie Leu His lie Val Asp His Val Asp Leu Asp Ala Trp 
340 345 350 

Ser Val Pro Glu Val Met Arg Pro Leu Tyr Glu Ser Ser Lys lie Leu 
355 360 365 

Ala Gin Lys Met Thr Val Ala Ala Leu Arg His Val Arg Gin lie Ala 
370 375 380 

Gin Glu Thr Ser Gly Glu Val Gin Tyr Gly Gly Gly Arg Gin Pro Ala 
385 390 395 400 
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Val Leu Arg Thr Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala 
405 410 415 

Val Asn Gly Phe Val Asp Asp Gly Trp Ser Pro Met Gly Ser Asp Gly 
420 425 430 

Ala Glu Asp Val Thr Val Met lie Asn Leu Ser Pro Gly Lys Phe Gly 
435 440 445 

Gly Ser Gin Tyr Gly Asn Ser Phe Leu Pro Ser Phe Gly Ser Gly Val 
450 455 460 

Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 
465 470 475 480 

Leu Val Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Tyr Gly 
485 490 495 

Val Asp Ala Tyr Ala Ala Ala Ser Leu Arg Ala Ser Pro Phe Ala Val 
500 505 510 

Pro Cys Ala Arg Ala Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro 
515 520 525 

Leu Ala Gin Thr Val Glu His Glu Glu Ser Leu Glu Val Val Arg Leu 
530 535 540 

Glu Gly His Ala Tyr Ser Pro Glu Asp Met Gly Leu Ala Arg Asp Met 
545 550 555 560 

Tyr Leu Leu Gin Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly 
565 570 575 

Cys Ala Gin Leu Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp 
580 585 590 

Ala Pro Leu Leu Pro Ser Gly Phe Arg lie lie Pro Leu Glu Gin Lys 
595 600 605 

Ser Thr Pro Asn Gly Ala Ser Ala Asn Arg Thr Leu Asp Leu Ala Ser 
610 615 620 

Ala Leu Glu Gly Ser Thr Arg Gin Ala Gly Glu Ala Asp Pro Asn Gly 
625 630 635 640 

Cys Asn Phe Arg Ser Val Leu Thr lie Ala Phe Gin Phe Thr Phe Asp 
645 650 655 

Asn His Ser Arg Asp Ser Val Ala Ser Met Ala Arg Gin Tyr Val Arg 
660 665 670 

Ser lie Val Gly Ser lie Gin Arg Val Ala Leu Ala lie Ala Pro Arg 
675 680 685 



Pro Gly Ser Asn lie Ser Pro lie Ser Val Pro Thr Ser Pro Glu Ala 
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690 695 700 

Leu Thr Leu Val Arg Trp lie Ser Arg Ser Tyr Ser Leu His Thr Gly 
705 710 715 720 

Ala Asp Leu Phe Gly Ser Asp Ser Gin Thr Ser Gly Asp Thr Leu Leu 
725 730 735 

His Gin Leu Trp Asn His Ser Asp Ala lie Leu Cys Cys Ser Leu Lys 
740 745 750 



Thr 




<210> 


43 


<211> 


2526 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2526) 


<223> 


G3 90 


<400> 


43 



cag ate aaa gtt tgg ttc cag aat cgc aga tgt cga gag aag cag agg 
Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg 
65 70 75 80 



aac aag ctt ttg atg gaa gag aat gat cgt ttg cag aag caa gtc tec 
Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser 
100 105 110 



cag caa cgt cag cag caa aac cca aca cat cag cat cct cag cgt gat 
Gin Gin Arg Gin Gin Gin Asn Pro Thr His Gin His Pro Gin Arg Asp 
145 150 155 160 

gtt aac aac cca get aat ctt etc teg att gcg gag gag ace ttg geg 
Val Asn Asn Pro Ala Asn Leu Leu Ser lie Ala Glu Glu Thr Leu Ala 
165 170 175 



96 



atg atg get cat cac tec atg gac gat aga gae tet cct gat aaa gga 48 
Met Met Ala His His Ser Met Asp Asp Arg Asp Ser Pro Asp Lys Gly 

15 10 15 

ttt gat tec ggc aag tac gtt aga tac aeg ccg gaa caa gtt gaa get 
Phe Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala 
20 25 30 

ctt gag aga gtt tat get gag tgt cct aaa cct age tet ctg aga aga 144 
Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser Ser Leu Arg Arg 
35 40 45 

caa cag ctt att cgt gaa tgt ccc att etc tgt aac ate gag cct cga 192 
Gin Gin Leu lie Arg Glu Cys Pro lie Leu Cys Asn lie Glu Pro Arg 
50 55 60 



240 



aaa gag tea get cgt ctt cag aca gtg aac agg aag ctg agt get atg 288 
Lys Glu Ser Ala Arg Leu Gin Thr Val Asn Arg Lys Leu Ser Ala Met 
85 90 95 



336 



aac ttg gtt tat gag aat gga ttc atg aaa cat cga ate cac act get 384 
Asn Leu Val Tyr Glu Asn Gly Phe Met Lys His Arg lie His Thr Ala 
115 120 125 

tet ggg acg ace aca gac aac age tgt gag tet gtg gtc gtg agt ggt 432 
Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val Val Val Ser Gly 
130 135 140 



480 



528 
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gag ttc ctt tgc aag get aca gga act get gtc gac tgg gtc cag atg 576 
Glu Phe Leu Cys Lys Ala Thr Gly Thr Ala Val Asp Trp Val Gin Met 
180 185 190 

att ggg atg aag cct ggt ccg gat tct att ggt ate gta get gtt tea 624 
lie Gly Met Lys Pro Gly Pro Asp Ser lie Gly lie Val Ala Val Ser 
195 200 205 

cgc aac tgc agt gga ata gca gca cgt gcc tgt ggc etc gtg agt tta 672 
Arg Asn Cys Ser Gly lie Ala Ala Arg Ala Cys Gly Leu Val Ser Leu 
210 215 220 

gaa ccc atg aag gtc get gaa ate etc aaa gat egt cca tct tgg ttc 720 
Glu Pro Met Lys Val Ala Glu lie Leu Lys Asp. Arg Pro Ser Trp Phe 
225 230 235 240 

cgt gac tgt ega tgt gtc gag act etg aat gtt ata ccc act gga aat 768 
Arg Asp Cys Arg Cys Val Glu Thr Leu Asn Val lie Pro Thr Gly Asn 
245 250 255 

ggt ggt act ate gag ctt gtc aae act eag att tat get ect aca aca 816 
Gly Gly Thr lie Glu Leu Val Asn Thr Gin lie Tyr Ala Pro Thr Thr 
260 265 270 

tta gca gea get egt gac ttt tgg aeg etg aga tat agt aca agt eta 864 
Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr Ser Thr Ser Leu 
275 280 285 

gaa gat gga age tat gtg gtc tgt gag aga tea etc act tct gea act 912 
Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu Thr Ser Ala Thr 
290 295 300 

ggt gge ccc aat ggt eea ett tet tea age tte gtg aga gcc aaa atg 960 
Gly Gly Pro Asn Gly Pro Leu Ser Ser Ser Phe Val Arg Ala Lys Met 
305 310 315 320 

etg tea age ggg ttt ett ate egt cet tgt gat ggt ggt ggt tee att 1008 
Leu Ser Ser Gly Phe Leu lie Arg Pro Cys Asp Gly Gly Gly Ser lie 
325 330 335 

att cac ate gtt gat cat gtg gac ttg gat gtc tea agt gtt cct gaa 1056 
lie His lie Val Asp His Val Asp Leu Asp Val Ser Ser Val Pro Glu 
340 345 350 

gtc etc agg ect ctt tat gag tct tec aaa ate ctt get caa aaa atg 1104 
Val Leu Arg Pro Leu Tyr Glu Ser Ser Lys lie Leu Ala Gin Lys Met 

355 360 365 

act gtc get get etg aga cat gtg cgc caa att get caa gag act agt 1152 
Thr Val Ala Ala Leu Arg His Val Arg Gin lie Ala Gin Glu Thr Ser 

370 375 380 

gga gaa gtc eag tat agt ggt gga cgc eag ect gea gtt tta agg act 1200 
Gly Glu Val Gin Tyr Ser Gly Gly Arg Gin Pro Ala Val Leu Arg Thr 
385 390 395 400 

tte age eag aga etc tgc egg ggt tte aat gat get gta aat ggt ttt 1248 
Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala Val Asn Gly Phe 
405 410 415 

gtc gat gat gga tgg tet eea atg agt agt gat gga gga gag gat att 1296 
Val Asp Asp Gly Trp Ser Pro Met Ser Ser Asp Gly Gly Glu Asp lie 
420 425 430 

aeg ate atg att aae tet tec tet get aaa ttt get gge tee caa tac 1344 
Thr lie Met lie Asn Ser Ser Ser Ala Lys Phe Ala Gly Ser Gin Tyr 
435 440 445 

ggt age tea ttt ett eea agt ttt gga agt ggt gtc etc tgt gcc aaa 1392 
Gly Ser Ser Phe Leu Pro Ser Phe Gly Ser Gly Val Leu Cys Ala Lys 
450 455 460 

get tct atg etg ttg cag aat gtt cca ccc ctt gta ttg att egg ttc 1440 
Ala Ser Met Leu Leu Gin Asn Val Pro Pro Leu Val Leu lie Arg Phe 
465 470 475 480 
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ctg aga gaa cac cga get gaa tgg gca gac tat ggt gtc gat gcc tat 14 88 

Leu Arg Glu His Arg Ala Glu Trp Ala Asp Tyr Gly Val Asp Ala Tyr 
485 490 495 

tct get gca tct etc aga gca act cca tat get gtt cca tgc gtc aga 1536 
Ser Ala Ala Ser Leu Arg Ala Thr Pro Tyr Ala Val Pro Cys Val Arg 
500 505 510 

acc ggt ggg ttc ccg agt aac caa gtc att ctt cct etc gca cag aca 1584 
Thr Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro Leu Ala Gin Thr 
515 520 525 

etc gaa cat gaa gag ttt etc gaa gtg gtt aga ctt gga ggt cat get 1632 
Leu Glu His Glu Glu Phe Leu Glu Val Val Arg Leu Gly Gly His Ala 
530 535 540 

tac tea cct gaa gac atg ggc tta tec egg gat atg tat tta ctg cag 1680 
Tyr Ser Pro Glu Asp Met Gly Leu Ser Arg Asp Met Tyr Leu Leu Gin 
545 550 555 560 

ctt tgt age ggc gtt gat gaa aat gtg gtt gga ggt tgt get eag ctt 1728 
Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly Cys Ala Gin Leu 
565 570 575 

gtc ttt gcc cca ate gat gaa tea ttt get gat gat gca cct ttg ctt 1776 
Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp Ala Pro Leu Leu 
580 585 590 

cct tct ggt ttc cgt gtc ata cca etc gac caa aaa aca aat ccg aat 1824 
Pro Ser Gly Phe Arg Val lie Pro Leu Asp Gin Lys Thr Asn Pro Asn 

595 600 605 

gat cat caa tct gca agt cga aca egg gat eta gca teg tec eta gat 1872 
Asp His Gin Ser Ala Ser Arg Thr Arg Asp Leu Ala Ser Ser Leu Asp 

610 615 620 

ggt tec acc aaa acc gat teg gaa aca aac tct aga ttg gtc tta aca 1920 
Gly Ser Thr Lys Thr Asp Ser Glu Thr Asn Ser Arg Leu Val Leu Thr 
625 630 635 640 

ata gcc ttc cag ttc acg ttt gat aac cat tec aga gac aat gtt get 1968 
lie Ala Phe Gin Phe Thr Phe Asp Asn His Ser Arg Asp Asn Val Ala 
645 650 655 

aca atg gcg aga cag tat gtg agg aac gtt gtt ggt teg att cag aga 2016 
Thr Met Ala Arg Gin Tyr Val Arg Asn Val Val Gly Ser lie Gin Arg 
660 665 670 

gtg get eta gcc att acg cct cgt cct ggc tea atg caa ctt cec act 2064 
Val Ala Leu Ala lie Thr Pro Arg Pro Gly Ser Met Gin Leu Pro Thr 
675 680 685 

tec cct gaa get etc act ctt gtc cgt tgg ate acc cgt agt tac agt 2112 
Ser Pro Glu Ala Leu Thr Leu Val Arg Trp lie Thr Arg Ser Tyr Ser 
690 695 700 

att cat aca ggt gca gat ctg ttt gga get gat tct eag tec tgt gga 2160 
lie His Thr Gly Ala Asp Leu Phe Gly Ala Asp Ser Gin Ser Cys Gly 
705 710 715 720 

gga gac aca ttg ctt aag caa etc tgg gac cat agt gat gcc ata ttg 2208 
Gly Asp Thr Leu Leu Lys Gin Leu Trp Asp His Ser Asp Ala lie Leu 

725 730 735 

tgc tgc tec ctg aaa act aat gcc tea ccg gta ttc aca ttt gca aac 2256 
Cys Cys Ser Leu Lys Thr Asn Ala Ser Pro Val Phe Thr Phe Ala Asn 
740 745 750 

caa get ggt tta gac atg ctt gaa act aca ctt gtg gca ctt eag gat 2304 
Gin Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp 
755 760 765 

ata atg etc gac aaa aca ctt gat gac tct ggt cgt aga get ctt tgc 2352 
lie Met Leu Asp Lys Thr Leu Asp Asp Ser Gly Arg Arg Ala Leu Cys 
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770 775 780 

tec gag ttc gcc aag ate atg eag cag gga tat geg aat ett ccg gea 2400 
Ser Glu Phe Ala Lys lie Met Gin Gin Gly Tyr Ala Asn Leu Pro Ala 
785 790 795 800 

gga ata tgt gtg teg age atg ggc aga ccg gtt teg tat gag caa geg 244 8 

Gly lie Cys Val Ser Ser Met Gly Arg Pro Val Ser Tyr Glu Gin Ala 
805 810 815 

acg gtg tgg aaa gtt gtt gat gae aac gaa tea aae cac tgc ttg get 2496 
Thr Val Trp Lys Val Val Asp Asp Asn Glu Ser Asn His Cys Leu Ala 
820 825 830 

ttt ace etc gtt agt tgg teg ttt gtt tga 2526 
Phe Thr Leu Val Ser Trp Ser Phe Val 
835 840 

<210> 44 
<211> 841 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 44 

Met Met Ala His His Ser Met Asp Asp Arg Asp Ser Pro Asp Lys Gly 
15 10 15 

Phe Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala 
20 25 30 

Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser Ser Leu Arg Arg 
35 40 45 

Gin Gin Leu lie Arg Glu Cys Pro lie Leu Cys Asn lie Glu Pro Arg 
50 55 60 

Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg 
65 70 75 80 

Lys Glu Ser Ala Arg Leu Gin Thr Val Asn Arg Lys Leu Ser Ala Met 
85 90 95 

Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser 
100 105 110 

Asn Leu Val Tyr Glu Asn Gly Phe Met Lys His Arg lie His Thr Ala 
115 120 125 

Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val Val Val Ser Gly 
130 135 140 

Gin Gin Arg Gin Gin Gin Asn Pro Thr His Gin His Pro Gin Arg Asp 
145 150 155 160 

Val Asn Asn Pro Ala Asn Leu Leu Ser lie Ala Glu Glu Thr Leu Ala 
165 170 175 

Glu Phe Leu Cys Lys Ala Thr Gly Thr Ala Val Asp Trp Val Gin Met 
180 185 190 



lie Gly Met Lys Pro Gly Pro Asp Ser lie Gly He Val Ala Val Ser 
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195 200 205 

Arg Asn Cys Ser Gly lie Ala Ala Arg Ala Cys Gly Leu Val Ser Leu 
210 215 220 

Glu Pro Met Lys Val Ala Glu He Leu Lys Asp Arg Pro Ser Trp Phe 
225 230 235 240 

Arg Asp Cys Arg Cys Val Glu Thr Leu Asn Val He Pro Thr Gly Asn 
245 250 255 

Gly Gly Thr He Glu Leu Val Asn Thr Gin lie Tyr Ala Pro Thr Thr 
260 265 270 

Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr Ser Thr Ser Leu 
275 280 285 

Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu Thr Ser Ala Thr 
290 295 300 

Gly Gly Pro Asn Gly Pro Leu Ser Ser Ser Phe Val Arg Ala Lys Met 
305 310 315 320 

Leu Ser Ser Gly Phe Leu He Arg Pro Cys Asp Gly Gly Gly Ser He 
325 330 335 

He His He Val Asp His Val Asp Leu Asp Val Ser Ser Val Pro Glu 
340 345 350 

Val Leu Arg Pro Leu Tyr Glu Ser Ser Lys He Leu Ala Gin Lys Met 
355 360 365 

Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala Gin Glu Thr Ser 
370 375 380 

Gly Glu Val Gin Tyr Ser Gly Gly Arg Gin Pro Ala Val Leu Arg Thr 
385 390 395 400 

Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala Val Asn Gly Phe 
405 410 415 

Val Asp Asp Gly Trp Ser Pro Met Ser Ser Asp Gly Gly Glu Asp He 
420 425 430 

Thr He Met He Asn Ser Ser Ser Ala Lys Phe Ala Gly Ser Gin Tyr 
435 440 445 

Gly Ser Ser Phe Leu Pro Ser Phe Gly Ser Gly Val Leu Cys Ala Lys 
450 455 460 

Ala Ser Met Leu Leu Gin Asn Val Pro Pro Leu Val Leu He Arg Phe 
465 470 475 480 

Leu Arg Glu His Arg Ala Glu Trp Ala Asp Tyr Gly Val Asp Ala Tyr 
485 490 495 
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Ser Ala Ala Ser Leu Arg Ala Thr Pro Tyr Ala Val Pro Cys Val Arg 
500 505 510 

Thr Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro Leu Ala Gin Thr 
515 520 525 

Leu Glu His Glu Glu Phe Leu Glu Val Val Arg Leu Gly Gly His Ala 
530 535 540 

Tyr Ser Pro Glu Asp Met Gly Leu Ser Arg Asp Met Tyr Leu Leu Gin 
545 550 555 560 

Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly Cys Ala Gin Leu 
565 570 575 

Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp Ala Pro Leu Leu 
580 585 590 

Pro Ser Gly Phe Arg Val lie Pro Leu Asp Gin Lys Thr Asn Pro Asn 
595 600 605 

Asp His Gin Ser Ala Ser Arg Thr Arg Asp Leu Ala Ser Ser Leu Asp 
610 615 620 

Gly Ser Thr Lys Thr Asp Ser Glu Thr Asn Ser Arg Leu Val Leu Thr 
625 630 635 640 

lie Ala Phe Gin Phe Thr Phe Asp Asn His Ser Arg Asp Asn Val Ala 
645 650 655 

Thr Met Ala Arg Gin Tyr Val Arg Asn Val Val Gly Ser lie Gin Arg 
660 665 670 

Val Ala Leu Ala lie Thr Pro Arg Pro Gly Ser Met Gin Leu Pro Thr 
675 680 685 

Ser Pro Glu Ala Leu Thr Leu Val Arg Trp lie Thr Arg Ser Tyr Ser 
690 695 700 

lie His Thr Gly Ala Asp Leu Phe Gly Ala Asp Ser Gin Ser Cys Gly 
705 710 715 720 

Gly Asp Thr Leu Leu Lys Gin Leu Trp Asp His. Ser Asp Ala lie Leu 
725 730 735 

Cys Cys Ser Leu Lys Thr Asn Ala Ser Pro Val Phe Thr Phe Ala Asn 
740 745 750 

Gin Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp 
755 760 765 

lie Met Leu Asp Lys Thr Leu Asp Asp Ser Gly Arg Arg Ala Leu Cys 
770 775 780 

Ser Glu Phe Ala Lys lie Met Gin Gin Gly Tyr Ala Asn Leu Pro Ala 
785 790 795 800 
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Gly lie Cys Val Ser Ser Met Gly Arg Pro Val Ser Tyr Glu Gin Ala 
805 810 815 

Thr Val Trp Lys Val Val Asp Asp Asn Glu Ser Asn His Cys Leu Ala 
820 825 830 

Phe Thr Leu Val Ser Trp Ser Phe Val 

840 





835 


<210> 


45 


<211> 


2511 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (2511) 


<223> 


G1548 


<400> 


45 



atg gca atg tct tgc aag gat ggt aag ttg gga tgt ttg gat aat ggg 4 8 

Met Ala Met Ser Cys Lys Asp Gly Lys Leu Gly Cys Leu Asp Asn Gly 
15 10 15 

aag tat gtg agg tat aca cct gaa caa gtt gaa gca ctt gag agg ctt 96 
Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala Leu Glu Arg Leu 

20 25 30 

tat cat gac tgt cct aaa ccg agt tct att cgc cgt cag cag ttg ate 144 
Tyr His Asp Cys Pro Lys Pro Ser Ser He Arg Arg Gin Gin Leu He 
35 40 45 

aga gag tgt cct att etc tct aac att gag cct aaa cag ate aaa gtg 192 
Arg Glu Cys Pro He Leu Ser Asn He Glu Pro Lys Gin He Lys Val 
50 55 60 

tgg ttt cag aac cga aga tgt aga gag aaa caa agg aaa gag get tea 24 0 

Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg Lys Glu Ala Ser 
65 70 75 80 

egg ctt caa get gtg aat egg aag ttg acg gca atg aac aag etc ttg 288 
Arg Leu Gin Ala Val Asn Arg Lys Leu Thr Ala Met Asn Lys Leu Leu 
85 90 95 

atg gag gag aat gac agg ttg cag aag caa gtg tea cag etg gtc eat 336 
Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser Gin Leu Val His 
100 105 110 

gaa aac age tac tte cgt caa cat act cca aat cct tea etc cca get 384 
Glu Asn Ser Tyr Phe Arg Gin His Thr Pro Asn Pro Ser Leu Pro Ala 
115 120 125 

aaa gac aca age tgt gaa teg gtg gtg acg agt ggt cag cac caa ttg 432 
Lys Asp Thr Ser Cys Glu Ser Val Val Thr Ser Gly Gin His Gin Leu 
130 135 140 

gca tct caa aat cct cag aga gat get agt cct gca gga ctt ttg tee 480 
Ala Ser Gin Asn Pro Gin Arg Asp Ala Ser Pro Ala Gly Leu Leu Ser 
145 150 155 160 

att gca gaa gaa act tta gca gag ttt ctt tea aag gca act gga ace 528 
He Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 
165 170 175 

get gtt gag tgg gtt cag atg cct gga atg aag cct ggt ccg gat tee 576 
Ala Val Glu Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 190 

att gga ate ate get att tct cat ggt tgc act ggt gtg gca gca cgc 624 
He Gly He He Ala He Ser His Gly Cys Thr Gly Val Ala Ala Arg 
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gcc tgt ggc eta gtg ggt ctt gag cct aca agg gtt gca gag att gtc 672 
Ala Cys Gly Leu Val Gly Leu Glu Pro Thr Arg Val Ala Glu lie Val 
210 215 220 

aag gat cgt cct teg tgg ttc cgc gaa tgt cga get gtt gaa gtt atg 720 
Lys Asp Arg Pro Ser Trp Phe Arg Glu Cys Arg Ala Val Glu Val Met 
225 230 235 240 

aac gtg ttg cca act gcc aat ggt gga acc gtt gag ctg ctt tat atg 768 
Asn Val Leu Pro Thr Ala Asn Gly Gly Thr Val Glu Leu Leu Tyr Met 

245 250 255 

cag etc tat gca cca act aca ttg gcc cca cca cgc gat ttc tgg ctg 816 
Gin Leu Tyr Ala Pro Thr Thr Leu Ala Pro Pro Arg Asp Phe Trp Leu 
260 265 270 

tta cgt tac acc tct gtt tta gaa gat ggc age ctt gtg gtg tgc gag 864 
Leu Arg Tyr Thr Ser Val Leu Glu Asp Gly Ser Leu Val Val Cys Glu 
275 280 285 

aga tct ctt aag age act caa aat ggt cct agt atg cca ctg gtt cag 912 
Arg Ser Leu Lys Ser Thr Gin Asn Gly Pro Ser Met Pro Leu Val Gin 
290 295 300 

aat ttt gtg aga gca gag atg ctt tec agt ggg tac ttg ata egg cct 960 
Asn Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu He Arg Pro 
305 310 315 320 

tgt gat ggt ggt ggc tea ate ata cac ata gtg gat cat atg gat ttg 1008 
Cys Asp Gly Gly Gly Ser lie He His He Val Asp His Met Asp Leu 
325 330 335 

gag get tgt age gtg cct gag gtc ttg cgc eeg etc tat gag tea ccc 1056 
Glu Ala Cys Ser Val Pro Glu Val Leu Arg Pro Leu Tyr Glu Ser Pro 
340 345 350 

aaa gta ctt gca cag aag aca aca atg gcg gca ctg cgt cag etc aag 1104 
Lys Val Leu Ala Gin Lys Thr Thr Met Ala Ala Leu Arg Gin Leu Lys 
355 360 365 

caa ata get cag gag gtt act cag act aat agt agt gtt aat ggg tgg 1152 
Gin He Ala Gin Glu Val Thr Gin Thr Asn Ser Ser Val Asn Gly Trp 
370 375 380 

gga egg cgt cct get gee tta aga get etc age cag agg eta age aga 1200 
Gly Arg Arg Pro Ala Ala Leu Arg Ala Leu Ser Gin Arg Leu Ser Arg 

385 390 395 400 

ggc ttc aat gaa get gta aat ggt ttc act gat gaa gga tgg tea gtg 1248 
Gly Phe Asn Glu Ala Val Asn Gly Phe Thr Asp Glu Gly Trp Ser Val 
405 410 415 

ata gga gat age atg gat gat gtc aca ate act gta aac tct tct cca 1296 
He Gly Asp Ser Met Asp Asp Val Thr He Thr Val Asn Ser Ser Pro 
420 425 430 

gac aag eta atg ggt eta aat ctt aca ttt gcc aat ggc ttt get cct 1344 
Asp Lys Leu Met Gly Leu Asn Leu Thr Phe Ala Asn Gly Phe Ala Pro 
435 440 445 

gta age aat gtt gtt tta tgc gca aaa gca tea atg ctt tta cag aat 1392 
Val Ser Asn Val Val Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn 
450 455 460 

gtt cct eeg gcg ate ctg ctt egg ttt ctg agg gag cat agg tea gaa 1440 
Val Pro Pro Ala He Leu Leu Arg Phe Leu Arg Glu His Arg Ser Glu 
465 470 475 480 

t99 get gac aac aac att gat gcg tat eta gca gca gca gtt aaa gta 1488 
Trp Ala Asp Asn Asn He Asp Ala Tyr Leu Ala Ala Ala Val Lys Val 
485 490 495 

ggg cct tgt agt gee cga gtt gga gga ttt gga ggg cag gtt ata ctt 1536 
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Gly Pro Cys Ser Ala Arg Val Gly Gly Phe Gly Gly Gin Val lie Leu 

500 505 510 

cca ctt get cat act att gag cat gaa gag ttt atg gaa gtc ate aaa 1584 
Pro Leu Ala His Thr lie Glu His Glu Glu Phe Met Glu Val lie Lys 
515 520 525 

ttg gaa ggt ctt ggt cat tec cet gaa gat gca ate gtt cca aga gat 1632 
Leu Glu Gly Leu Gly His Ser Pro Glu Asp Ala lie Val Pro Arg Asp 
530 535 540 

ate ttc ctt ctt caa ctt tgt age gga atg gat gaa aat get gta gga 1680 
lie Phe Leu Leu Gin Leu Cys Ser Gly Met Asp Glu Asn Ala Val Gly 
545 550 555 560 

ace tgt geg gaa ett ata ttt get eca ate gat get teg ttt geg gat 1728 
Thr Cys Ala Glu Leu lie Phe Ala Pro lie Asp Ala Ser Phe Ala Asp 
565 570 575 

gat gca cct ctg ctt ect tet ggt ttt egt att ate cct ctt gat tec 1776 
Asp Ala Pro Leu Leu Pro Ser Gly Phe Arg lie lie Pro Leu Asp Ser 
580 585 590 

gca aag gaa gta tct age cca aac cga acc ttg gat ctt get teg gca 1824 
Ala Lys Glu Val Ser Ser Pro Asn Arg Thr Leu Asp Leu Ala Ser Ala 
595 600 605 

ctg gaa att ggt tea get gga aca aaa gee tea act gat caa tea gga 1872 
Leu Glu lie Gly Ser Ala Gly Thr Lys Ala Ser Thr Asp Gin Ser Gly 
610 615 620 

aac tec aca tgt gca aga tct gtg atg aca ata gca ttt gag ttt ggt 1920 
Asn Ser Thr Cys Ala Arg Ser Val Met Thr lie Ala Phe Glu Phe Gly 

625 630 635 640 

ate gag age cat atg caa gaa cat gta gca tee atg get agg eag tat 1968 
lie Glu Ser His Met Gin Glu His Val Ala Ser Met Ala Arg Gin Tyr 
645 650 655 

gtt cga ggt ate ata tea teg gtg eag aga gta gca ttg get ctt tct 2016 
Val Arg Gly lie lie Ser Ser Val Gin Arg Val Ala Leu Ala Leu Ser 
660 665 670 

cct tet cat ate age tea caa gtt ggt eta cgc act cet ttg ggt act 2064 
Pro Ser His lie Ser Ser Gin Val Gly Leu Arg Thr Pro Leu Gly Thr 
675 680 685 

cct gaa gee caa aca ctt get egt tgg att tgc cag agt tac agg ggc 2112 
Pro Glu Ala Gin Thr Leu Ala Arg Trp lie Cys Gin Ser Tyr Arg Gly 
690 695 700 

tac atg ggt gtt gag eta ctt aaa tea aac agt gac ggc aat gaa tct 2160 
Tyr Met Gly Val Glu Leu Leu Lys Ser Asn Ser Asp Gly Asn Glu Ser 
705 710 715 720 

att ctt aag aat ctt tgg cat cae act gat get ata ate tgc tgc tea 2208 
lie Leu Lys Asn Leu Trp His His Thr Asp Ala lie lie Cys Cys Ser 
725 730 735 

atg aag gee ttg ecc gtc ttc aca ttt gca aac cag geg gga ett gac 2256 
Met Lys Ala Leu Pro Val Phe Thr Phe Ala Asn Gin Ala Gly Leu Asp 
740 745 750 

atg ctg gag act aca tta gtt get ett caa gac ate tct tta gag aag 2304 
Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp lie Ser Leu Glu Lys 

755 760 765 

ata ttt gat gac aat gga aga aag act ctt tgc tct gag ttc eca cag 2352 
lie Phe Asp Asp Asn Gly Arg Lys Thr Leu Cys Ser Glu Phe Pro Gin 
770 775 780 

ate atg caa cag ggc ttc geg tgc ctt caa ggc ggg ata tgt etc tea 2400 
lie Met Gin Gin Gly Phe Ala Cys Leu Gin Gly Gly lie Cys Leu Ser 
785 790 795 800 
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age atg ggg aga cca gtt teg tat gag aga gca gtt get tgg aaa gta 2448 
Ser Met Gly Arg Pro Val Ser Tyr Glu Arg Ala Val Ala Trp Lys Val 
805 810 815 

etc aat gaa gaa gaa aat get cat tgc ate tgc ttt gtg ttc ate aat 2496 
Leu Asn Glu Glu Glu Asn Ala His Cys lie Cys Phe Val Phe lie Asn 
820 825 830 

tgg tec ttt gtg tga 2511 
Trp Ser Phe Val 
835 

<210> 46 
<211> 836 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 46 

Met Ala Met Ser Cys Lys Asp Gly Lys Leu Gly Cys Leu Asp Asn Gly 
15 10 15 

Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala Leu Glu Arg Leu 
20 25 30 

Tyr His Asp Cys Pro Lys Pro Ser Ser lie Arg Arg Gin Gin Leu lie 
35 40 45 

Arg Glu Cys Pro lie Leu Ser Asn lie Glu Pro Lys Gin lie Lys Val 
50 55 60 

Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg Lys Glu Ala Ser 
65 70 75 80 

Arg Leu Gin Ala Val Asn Arg Lys Leu Thr Ala Met Asn Lys Leu Leu 
85 90 95 

Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser Gin Leu Val His 
100 105 110 

Glu Asn Ser Tyr Phe Arg Gin His Thr Pro Asn Pro Ser Leu Pro Ala 
115 120 125 

Lys Asp Thr Ser Cys Glu Ser Val Val Thr Ser Gly Gin His Gin Leu 
130 135 140 

Ala Ser Gin Asn Pro Gin Arg Asp Ala Ser Pro Ala Gly Leu Leu Ser 
145 150 155 160 

lie Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 
165 170 175 

Ala Val Glu Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 190 

lie Gly lie lie Ala lie Ser His Gly Cys Thr Gly Val Ala Ala Arg 
195 200 205 

Ala Cys Gly Leu Val Gly Leu Glu Pro Thr Arg Val Ala Glu lie Val 
210 215 220 
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Lys Asp Arg Pro Ser Trp Phe Arg Glu Cys Arg Ala Val Glu Val Met 
225 230 235 240 

Asn Val Leu Pro Thr Ala Asn Gly Gly Thr Val Glu Leu Leu Tyr Met 
245 250 255 

Gin Leu Tyr Ala Pro Thr Thr Leu Ala Pro Pro Arg Asp Phe Trp Leu 
260 265 270 

Leu Arg Tyr Thr Ser Val Leu Glu Asp Gly Ser Leu Val Val Cys Glu 
275 280 285 

Arg Ser Leu Lys Ser Thr Gin Asn Gly Pro Ser Met Pro Leu Val Gin 
290 295 300 

Asn Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu lie Arg Pro 
305 310 315 320 

Cys Asp Gly Gly Gly Ser He He His He Val Asp His Met Asp Leu 
325 330 335 

Glu Ala Cys Ser Val Pro Glu Val Leu Arg Pro Leu Tyr Glu Ser Pro 
340 345 350 

Lys Val Leu Ala Gin Lys Thr Thr Met Ala Ala Leu Arg Gin Leu Lys 
355 360 365 

Gin He Ala Gin Glu Val Thr Gin Thr Asn Ser Ser Val Asn Gly Trp 
370 375 380 

Gly Arg Arg Pro Ala Ala Leu Arg Ala Leu Ser Gin Arg Leu Ser Arg 
385 390 395 400 

Gly Phe Asn Glu Ala Val Asn Gly Phe Thr Asp Glu Gly Trp Ser Val 
405 410 415 

He Gly Asp Ser Met Asp Asp Val Thr He Thr Val Asn Ser Ser Pro 
420 425 430 

Asp Lys Leu Met Gly Leu Asn Leu Thr Phe Ala Asn Gly Phe Ala Pro 
435 440 445 

Val Ser Asn Val Val Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn 
450 455 460 

Val Pro Pro Ala He Leu Leu Arg Phe Leu Arg Glu His Arg Ser Glu 
465 470 475 480 

Trp Ala Asp Asn Asn He Asp Ala Tyr Leu Ala Ala Ala Val Lys Val 
485 490 495 

Gly Pro Cys Ser Ala Arg Val Gly Gly Phe Gly Gly Gin Val He Leu 
500 505 510 

Pro Leu Ala His Thr He Glu His Glu Glu Phe Met Glu Val He Lys 
515 520 525 
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Leu Glu Gly Leu Gly His Ser Pro Glu Asp Ala lie Val Pro Arg Asp 
530 535 540 

lie Phe Leu Leu Gin Leu Cys Ser Gly Met Asp Glu Asn Ala Val Gly 
545 550 555 560 

Thr Cys Ala Glu Leu lie Phe Ala Pro lie Asp Ala Ser Phe Ala Asp 
565 570 575 

Asp Ala Pro Leu Leu Pro Ser Gly Phe Arg He He Pro Leu Asp Ser 
580 585 590 

Ala Lys Glu Val Ser Ser Pro Asn Arg Thr Leu Asp Leu Ala Ser Ala 
595 600 605 

Leu Glu lie Gly Ser Ala Gly Thr Lys Ala Ser Thr Asp Gin Ser Gly 
610 615 620 

Asn Ser Thr Cys Ala Arg Ser Val Met Thr He Ala Phe Glu Phe Gly 
625 630 635 640 

He Glu Ser His Met Gin Glu His Val Ala Ser Met Ala Arg Gin Tyr 
645 650 655 

Val Arg Gly He He Ser Ser Val Gin Arg Val Ala Leu Ala Leu Ser 
660 665 670 

Pro Ser His He Ser Ser Gin Val Gly Leu Arg Thr Pro Leu Gly Thr 
675 680 685 

Pro Glu Ala Gin Thr Leu Ala Arg Trp He Cys Gin Ser Tyr Arg Gly 
690 695 700 

Tyr Met Gly Val Glu Leu Leu Lys Ser Asn Ser Asp Gly Asn Glu Ser 
705 710 715 720 

He Leu Lys Asn Leu Trp His His Thr Asp Ala He He Cys Cys Ser 
725 730 735 

Met Lys Ala Leu Pro Val Phe Thr Phe Ala Asn Gin Ala Gly Leu Asp 
740 745 750 

Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Ser Leu Glu Lys 
755 760 765 

He Phe Asp Asp Asn Gly Arg Lys Thr Leu Cys Ser Glu Phe Pro Gin 
770 775 780 

He Met Gin Gin Gly Phe Ala Cys Leu Gin Gly Gly He Cys Leu Ser 
785 790 795 800 

Ser Met Gly Arg Pro Val Ser Tyr Glu Arg Ala Val Ala Trp Lys Val 
805 810 815 



Leu Asn Glu Glu Glu Asn Ala His Cys He Cys Phe Val Phe He Asn 
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820 825 830 



Trp Ser Phe Val 
835 



Page 84 



INTERNATIONAL SEARCH REPORT 



1 laiional application No. 
l»CnvUSOO/3i325 



A. ClASSll ICATION OI SUBJICCl MAI IKR 

I!*(:(7) : C:07n 2l/04; C\2U 5/10, 15/29, 15/63, 15/82 

US C:L : 435/320.1, 419, 440, 468; 536/23.1, 23.6; 800/278, 290 

Accord in to Iiitcniatio nal Patent Classification (W C) or to bot h national c lassification and 

n. 1 ]1:M)S SKAKCIIKD 



Mininiuni docunientaiion searcticd (classificalion system followed by classification syn)bols) 
U.S. : 435/320.1, 419, 440, 468; 536/23.1, 23.6; 800/278. 290 



DwuHRMUalion searched other than niininmin documentation to the extent that such documeius are included in the fields searched 



I 'lcctronic daia base consulted during the international searcli (name of data base and, where practicable, search terms used) 
IMease Sec Conlinuaiion Sheet 



C. 



nOC UMKNTS CONSIDICRKO TO BK R^:i.KVAN'J 



(.ategory 



(jtati on of document^ with indication, where appropriate , of (he relevant passages 



Database (Jenbank on NC^Bl, US National I jbrar>' of Medicine (Iknhesda, Ml), USA). 
No. AJ005196, iiUCniiOia, CI. el al. 'Nuclear-iocali/ed receiver-like proteins are 
differenlially expressed in Arabidopsis ihaliana'. Sepleniber 4, 1998. 

SAKAl, I !. el a(. 'IVxvc^Mnponent response regulators from Arabidopsis l ha liana coiUain a 
putative DNA binding motif. Plant Cell Physiology 1998, Vol 39 No. 11, pages 1232- 
1239, see entire document. 

(i]X)V]vR, H..1. el al. Development of sewral epidermal cell types can be specified by the 
same MYB-related plant transcription factor. Development 1998, Vol 125, pages 3497- 
3508, see entire document. 



MARTIN, (-. el al. MYU transcription factors in plants. 
1997, Vol 13, No 2, pages 67-73, see entire document. 



Trends in Ceneiics, I 'ebruary 



lielevant to claim No. 



4,6,9,10 

1-3, 5, 7, 8,9,13, 27 27 
4,6 

1-10, 13, 25 27 
MO, 13, 25-27 



I'uriher dtKuments are listed in the ctnitiimation of Box (' 



□ 



See patent family annex. 



* Sjviiai tatep.^H its o! titvd ckKuiiiciils: 

"A" (l<icittiiL-iil (kTitiiii{? ihv fviifful staK' ttf ibv ail which is not coiKiiltTcJ I(> Ih: 

"1 " fjif Iki applkaiidii (H jmIl-ih pittjjislit'd tm or at'lfi Itic inlcinatuuiat litiuj; (talc 

' I " (l(K.niiK'ii( wtiich iii;i> Uiiow (Jimbis on ptioiii) cl,niii(s) oi wlikli is ciIl-J iv 
esiai>ti<.h ilie putUkaium tlaic ot aiKidici tiiatkm oi oilier sixjcial fcascu (as 
sinrcilK'tl) 

"'()" docuiiKui rctcuiu? to an (>ral clisclfjsiire. use, cx.liit)itic»ii or otlter means 

• p" (lotuiiifid piililjstic'il piior to ilic iiiiei national /"ihnp <laic l)nl later lliaii the 

(II ioi (ly dale claiiiiird 



"I" 



'■X" 



later docinneiii jnililistied after the internatioDal UUu^ tl-Ue oi piiotH) 
date atH\ iioi in coitnici with the a|if>ricali(>n but cite<t to iindcr stand the 
piinciple or theory uiiflerlytnp. (he iiivctinou 

doctiiiiciil of parliciilar relevance; (tie claifited tiiventkm caiin(>i l>e 
consklered novel in cannot Ik: cons'uk'ietl to inv<tlvt," an inventive step 
\\ Ijcti Itie (iotimieni is taken alone 

docuitieul ot particular rtlevaucc; ttie claiiiie<l invention cannot Iv 
coijsidered to involve m invenlive stc|Mvticii ihe docnnient is 
cofiiliined with one or more other .such clocuiiienls, such coiiitiittaiioir 
l)einj.' (tln ious to a tiersou skilled In the art 

docinneiit i)iefiit>ei ol Ilie same p.(leul fatiiil} 



Date of the actual completion of the inter/ialional search 
04 April 2001 (t)4.04.20()l) 


I )a te ^^i^i \^fj^t 


im 


tiional search report 


Name and mailing address t)f the ISA/US 

Coiiifnissktiier of l*aienis and Tradeiiiarks 
iJox 

WashitijMou, 1>.(\ 2(»2^l 
l acsimile No. (703)305 3230 


Authori/ed officer ^ ^ . 
David 11 K ruse / h ^i/ / ( /^^ ^ 
Telephone No. 703 308 () 196 



ronn P("j7iSA/2U) (second sheet) (July 1998) 



INTERNATIONAL SEARCH REPORT 



'"lernational application No. 



Box 1 Observations where certain claims were found unsearchable (Conthuiation of Item 1 of first slicct) 



'] his inlcrnaiional report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons; 

1. Q] C:iaimNos.: 

because they relate to subject matter not required to be searched by this Authority, namely; 



□ 



('kiini Nos.: 

because they relate to parts of the international application that do not comply svith the prescribed requirenK-nts to 
such an extent that no nieaningful international search can be carried out, specifically: 



3. Claim Nos.: 14 and 2.^ 

because they are dependent claims and are noi drafted hi accordance with the second and third sentences of Rule 



6.4(a). 



Hex 11 Observations Nvlicrc unity of invention is lacking (Continuation of Item 2 of first sliect) 



This Inter nalioiuil Searching Authority found multiple inventions in this inlernatitMial application, as follows: 
l^lease vSec (\>iiliiiu:ilion vSiicet 



n 
n 



As all required additional search lees were timely paid by the applicant, this international search report ct)vers ; 
searchable claims. 



As nil searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 

paymcnl of any additional fee. 

As only some of the required additional search fees were timely paid by the applicant, this international search 
report co\ers t>nly those claims for which lees were paid, specifically claims Nos.: 1-H), 13, 25-27 and SMQ ID NO: 
1,2,29.K:30 



No required additional search fees were timely paid by the applicant, (ionsequently, this international search rept>rt 
is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on IVotest 




'i'he additional search fees were accompanied by ilie applicant's protest. 
No protest accompanied the payment ol additional search fees. 



l\Mm>('17lSA72l(r"(coi^^^^ of first sheet(l)) (July 1998) 



INTERNATIONAL SEARCH REPORT 



1 



iicrnHiional applicalioii Niv 
P(n7USOO/31325 



BOX II. OBSERVATIONS WIIERP: UNITY OF INVENTION IS LACKING This application contains the following 
inventions or groups of inventions which are not so linked as to form a single general inventive concept under PC'V Rule 13. 1 . In 
order for all inventions to be examined, the appropriate additional cxanunation fees must be paid. 

This application contains the following inventions or groups of inventions which are not so linked as lo form a single general 
inventive concept under I'C'T Rule 13.1. In order for all inventions lo be examined, the appropriate additional examination fees 
must be paid. 

(iroups I-XXllI, claim(s) 1-10, 13, 14 and 25-27, drawn to a transgenic plant having modified structure and development 
characteristics, polynucleotides and vectors for producing said transgenic plant and a method of making said transgenic plant. 
Applicant must elect one pair of sequences (one nucleic acid and the corresponding amino acid translation) to be examined, i.e. 
S\iQ 11) NO; 1 and 2 in Ciroup 1, SI :Q 11) NO: 3 and 4 in (houp II, SliQ ID NO: 5 and 6 ii] (Iroup III, etc. 

(iroup XXIV, claim(s) 1 1 and 12, drawn to an isolated or recombinant polypeptide. 

Ciioup XXV, claim(s) 15-17, drawn lo a n\eUKxl of identifying a factor that is modulated by or inleracls with a polypeptide. 

Oroup XXVI, claim(s) 18, drawn lo a method of identifying a molecule that modulates activity or expression of a polynucleotide 

or polypeptide. 

(iroup XXV 11, claim(s) 19 and 20, drawn to an integrated data system. 

(Jroup XXVIIl, claim(s) 21-24, drawn to a method of identifying a polynucleotide or polypeptide sequence homologue. 

The inventions listed as (iroups I XXVllI do not relate to a single general inventive concept under PC'T Rule 13.1 because, under 
PC'l Rule 13.2, they lack the same or corresponding special technical features for the follou ing reasons: 

The inventions listed as (iroups I-XXVIII do not relate to a single general inventive concept under PCI Rule 13. 1 because, under 
VC'V Rule 13.2, they lack the sanie or corresponding special technical features for the following reasons: (iroups I-XXIIl are 
drawn lo a transgenic plant and a method of producing said plant with a nucleic acid sequence encoding a wide variety o\' 
transcription factors, (iroup XXIV is drawn to a wide variety of isolated or recombinant polypeptides having iranscriptiinial factor 
activity. The methods of Oroups 1-XXlll differ from eacli other in that they are directed lo a plant transformation method and 
transgenic plant with a structurally and functionally distinct nucleic acid sequence which encodes a structurally and functionally 
distinct amino acid sequence. In addition, (iroups XXV, XXVI, XXVI 1 and XXVIll are differeni methods from any c^f (iroups I 
XXIll in thai they have differeni method steps and differeni end products, and (iroup XXVII requires a con\puler system. Thus, 
there is no single special technical feature, which links the inventions of (iroups I-XXVlll under R('T Rule 13.2. 



Contiimation of B. FIELDS SEARClIEn Item 3: i:AS r (llSl'A'l ), STN (A(iRl('()I A, BIOSIS, CAIM .DS, l .MHASH), 
Sequence vSearch S\ Q 11) NO: 1, 2; 29 and 30, N("Hl/(ienhank. 



l orm K"lVlSA/2"u) (extra sheet) (July 1998) 



